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VERBAL COMPREHENSION AND COMMUNICATION 
IN NEGRO AND WHITE CHILDREN! 


ARNOLD 8S. CARSON anp A. I. RABIN? 
Michigan State University 


Comparisons of intellectual functioning 
in Negroes and whites have been of con- 
cern to research workers for some time. 
The classical review and summary by 
Klineberg (1935) is not yet outdated. For 
the most part, the superiority of the whites, 
especially on verbal tasks, has been demon- 
strated. However, the improvement in in- 
telligence test scores of Negro children 
who migrated to the North from Southern 
states has also been shown. Moreover, the 
degree of improvement tends to be re- 
lated to duration of residence in the North. 

Performance tasks which correlate highly 
with “general intelligence”, such as the 
Goodenough Draw-A-Man IQ Test, have 
not yielded significant differences between 
Negro and white children (Anastasi & 
D’Angelo, 1952). It would appear that 
tasks requiring verbal comprehension and 
verbal expression are the ones that are 
to a large extent responsible for the sig- 
nificant differences between the groups. 

The present study is concerned with the 
investigation of these two functions, com- 
prehension and expression or communica- 
tion, in Negro and white children. We pro- 
pose to study these functions separately, 
and in relation to each other, in Southern 
and Northern Negro, and in Northern 


*Based on master’s thesis (Carson, 1959) 
submitted to the Department of Psychology, 
Michigan State University by A. S. Carson 
and supervised by A. I. Rabin. 

*The authors are grateful to G. F. King 
and C. F. Wrighley for their valuable as- 
sistance and suggestions. 
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white, children. Our prediction was, in 
consonance with previous related findings, 
that when groups are matched for age, 
grade placement, sex, and level of verbal 
comprehension, the white children will be 
superior to the Southern and Northern 
Negroes in verbal communication; also, 
that the Northern Negro children will be 
superior to the Southern Negro children 
on measures of verbal communication. 


METHOD 


The Full Range Picture Vocabulary 
Test (Ammons & Ammons, 1949) was 
selected as the method of determining the 
level of verbal comprehension. This test 
requires no verbal response from the sub- 
jects (S). It consists of a series of cards 
with several pictures on each card. The S 
is merely required to point to the specific 
picture each time the examiner reads a 
word from the vocabulary test. 

Two measures of verbal communication 
were employed for comparative purposes. 
First, the WISC Vocabulary (Weschsler, 
1949) was presented in the standard man- 
ner and the responses scored as pro- 
vided in the manual. Second, the words of 
the Full-Range Picture Vocabulary Test 
(FRPVT) which were “defined” by gestic- 
ulation were presented as a conventional 
vocabulary test, requiring oral definition. 

The conventional scoring of vocabulary 
tests is an “all or none” affair. No provi- 
sions are made for qualitative differences 
in definitions or in levels of abstraction, 


2 
4 
4 
/ 
Le 


48 ARNOLD 8S. CARSON AND A. 1. RABIN 


precision and communication. This lack of 
sensitivity in the conventional scoring of 
vocabulary tests has been pointed out long 
ago by Yacorzynski (1940). 

In order to provide for scoring of levels 
of communication with the vocabulary of 
the FRPVT an adaptation of a method re- 
ported in the literature (Rabin, King, & 
Ehrmann, 1955) was utilized. A descrip- 
tion of this Qualitative Vocabulary Scale, 
of the categories of responses and defini- 
tions of the word “wagon” as illustrative 
of the levels of communication appear be- 
low. 


Class 1. Categorization and Synonym 


a. Categorization. The categorization re- 
sponses classified by some definite scheme in 
terms of its universal characteristics. 


Response—“a vehicle” 


b. Synonym. The synonym response may 
essentially be used to replace the object or 
idea with no or little change in the denotative 
aspects of the stimulus word. 


Response—“a cart” 


Class 2. Essential Description. An essen- 
tial description response must give the char- 
acterizing features of the stimulus word. If 
the stimulus word is abstract, the response 
must create mental imagery of the relevant 
situation. If the stimulus word is concrete 
(physically tangible), the response must dif- 
ferentiate between the stimulus word and 
members in its class. 


Response—“It’s a wooden thing with 4 
wheels, and it looks like a box.” 
Class 3. Essential Function. An essential 
function response must describe primary 
rather than peripheral usage or purpose of 
an object or an idea. 


Response—“You ride in it out West.” 


Class 4. Example. An example response 
defines an object or idea in terms of its as- 
pects or members. 


Response—“There’s a red wagon 
kids play with.” 


Class 5. Vague Description and Vague 
Function 
a. Vague Description. A vague descrip- 
tion is a response that is not totally irrelevant 
but does not give the characterizing features 
of the object or idea. 


Response—“Something that has four wheels.” 


b. Vague Function. A vague function 
response describes the peripheral rather than 
primary usage or purpose of the object or the 
idea, 

Response—“It bumps into people.” 

Class 6. Error. The error response is totally 
irrelevant to the stimulus word. 

Response—“The dog ‘wagons’ his tail.” 

Class 7. Don’t Know. A “don’t know” re- 
sponse is a statement or a lack of statement 
designating that the S is unable to verbally 
define a word which he had previously des- 
ignated recognition for by a gesticulation 
response. 


Response—“Don’t know.” 
Subjects 
The Ss were Southern Negro (SN), 


Northern Negro (NN), and Northen 
white (NW) children in the fourth, fifth, | 
and sixth grades. None of the Ss were ad- | 
vanced or behind in their age-grade place- 
ment. The SN group comprised all of the 
Negro children within the age range ro 
9.5 to 11.5 years who were born and reared 


in the South and had migrated to Lansing, ; 


Michigan within 28 months of the date 
tested. All of the NW and NN children 
were born and reared in Ingham County, 
Michigan. Data was collected on the oe- 
cupational levels of the NW and NN chil- 
dren in accordance with the Dictionary o/ 
Occupational Titles (United States Gov- 
ernment Printing Office, 1944) classifica- 
tion. Two of the main wage earners in the 
NW children’s families were professional, 
technical or managerial workers, 4 were 
clerical or sales workers, 3 were service | 
employees, 9 were mechanical workers, and 
12 were manual laborers; respectively, 
there were 0, 5, 2, 5, and 18 NN mai 
wage earners. It may be noted that the 
occupational levels of main wage earner 
from the two samples are similar. Due to 
the recent relocation of the SN families, 
the main wage earners’ vocations were ut 
settled and therefore unavailable as dats. 

Children were tested with the FRPVT | 
at random from the NW and NN group 
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TABLE 1 
GrapE, Sex, AGE, anD MEAN PICTURE 
VocaBULARY Test SCORES OF THE 
THREE GROUPS 


Ages | Raw Score 
Grade Sex (Months)|of FRPVT 
Group 
& 

| 
NW 30/ 16; 8 | 6 | 16/14 128/10.9 43.67|4.95 
NN 30/ 16/ 8 | 6 | 15 125)10.9 43.57|5.74 
SN | 30/16; 8 | 6 14| 16 43.53)5.21 


in order to acquire Ss whose scores could 
be individually matched with the scores of 
children in the SN group. Table 1 sum- 
marizes relevant data for the three groups 
of children. 


PROCEDURE 


The FRPVT and the two oral vocabu- 
lary devices were administered individually 
to the Ss by the same examiner in single 
testing sessions. The SN children were 
tested first followed by random testing of 
NN and NW children who would qualify 
for the respective samples. When each SN 
child was matched with a NW and NN 
child in terms of the FRPVT scores, the 
testing was discontinued. 

Two independent scorers tallied verbal 
responses to the FRPVT. These responses 
were scored according to the classification 
system described above. The total per- 
centage of interscorer agreement was .79. 
Complete agreement between the two 
scorers was obtained after the debatable 


Tesponses were discussed. 


ReEsvuLts AND Discussion 


As a preliminary step in the treatment 
of the data, all Ss were classified into 
“high” and “low” responders. The Ss whose 
majority of responses were tallied in cate- 
gories 1, 2 and 3 were placed in the former 
classification; the predominance of the re- 
maining four categories characterized the 
second group. Table 2 reports the inci- 
dence of members of each sample in the 
“high” and “low” categories. The differ- 


ences in the distribution are statistically 
significant. 

As predicted, the white children assume 
the top position. It is also interesting to 
note that the NN group place in an inter- 
mediate position between the white and SN 
children. 

The responses of the three groups were 
also compared on each of the seven cate- 
gories comprising our qualitative (levels of 
communication) scoring system. Table 3 
summarizes the levels of significance of the 
differences between the groups computed 
by means of the Wilcoxon matched-pairs 
signed-rank test (Siegel, 1956). Thirteen 
of the 21 comparisons were significant in 
the predicted direction. The table indicates 
that the general prediction is supported 
to a considerable degree. This is especially 
true with the extremes of the qualitative 
categories. 

The results of the study demonstrated 
that NW children, of comparable non- 
verbal word recognition (or comprehen- 
sion) abilities, manifest higher levels of 
verbal communication than NN children, 
and, in turn, the NN children manifest 
higher levels of verbal communication than 
SN children. 

All comparisons by means of the WISC 
vocabulary subtest yield significant dif- 
ferences in the predicted direction (using 
Wilcoxon’s test mentioned above). 

Again, global vocabulary scores, as in 
the case of the more refined treatment of 
the FRPVT, indicate the superiority of 
the white children compared with the two 
Negro samples. Also, the NN group is in- 


TABLE 2 


A CoMPARISON OF THE INCIDENCE or HIGH 
AND Low RESPONDERS ON THE QUALI- 
TATIVE VOCABULARY SCALE IN 
THE THREE GROUPS 


Group High Low x p' 


NW 25 5 
NN 18 12 


SN 7 #2 22.54 <.001 


1 One-tailed test. 
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TABLE 3 inte 
Summary oF LEVELS oF SIGNIFICANCE OBTAINED FROM COMPARISONS OF RANKS geos 
ON THE QUALITATIVE VOCABULARY SCALE the 
that 
Groups and 
SN NW vs. NN NN vs. SN om 
betv 
(271.0 | 29.0 |3.44 |<.0005 |289.0 | 62.0 |2.87 |<.0025 |181.0 | 72.0 [1.76 | <.05 © 
2 (327.5 |107.5 (2.36 |<.01 [283.5 |122.5 |1.82 |<.05 (206.5 /171.5! ns 
3 |350.0 | 85.0 |2.85 |<.0005 |233.0 {118.0 |1.45 ns (326.0 |139.6 2.91 | <.05 © tant 
4 (274.5 |160.5 {1.22 ns 126.5 198.5 | .82 /301.0 |105.0 |2.22 | <. 025) ance 
5 |154.5 |223.5| .82| ns (147.0 /231.0 |1.00| ms [202.5 |203.5 | .00 tent 
6 34.5 (316.5 |3.57 |< .0005 | 57.5 |2.43 |<.01 55.5 ;295.5 3.04 <. utili 
7 3.0 |432.0 [4.63 |}<.0001 | 95.5 /229.5 |1.79 |< .05 40.5 394.5 3.82 | <.008 
1 If a two-tailed test were employed, the difference would have been significant in a di- of tl 
rection contrary to the current hypothesis. 3 > grou 
Fi 
TABLE 4 verb 
SumMary oF CoMPARISONS OF RANKS ON THE WISC VocaBULARY SUBTEST : nica 
Total Ranks Theoretical = 
NW NN SN Mean SD 
with 
NW vs. SN 432.0 — 3.0 217.5 46.25 4.62 <.0001 | task 
NN vs. SN — 345.5 60.6 203.0 43.92 3.24 <.001 conti 
NW vs.NN = 334.0 «101.0 —— 217.5 46.25 alg 
‘in is 
ferior to the NW but superior to the SN The differences among the groups cannot “PPe 
children. This finding is consistent with the be consistently explained according to ont and 
tests 


results in the previous two tables. 

The children’s pattern of responding is 
interesting in that it was unique for each 
of the groups and may therefore be a clue 
to differences in characteristic thinking 
among the three groups. The NW children 
favor the higher levels of verbal communi- 
cation (Classes 1, 2, 3). Their percentage 
of responses decreases sharply in the Ex- 
ample classification and after a slight in- 
crease in the Vague classification, their per- 
centage of responses continues to decrease 
lower than that of the two Negro groups 
in the Error and Don’t Know classifica- 
tions. NN children favor the Vague type 
response and manifest a greater percentage 
of Error type responses than the white chil- 
dren. The SN children’s pattern is the only 
one which manifests a high peak in the 
Don’t Know classification. 


theoretical viewpoint. Differences between 
the NW and NN children could lend sup ‘sts 
port to the contention that constitutional verb 
racial differences in intelligence exist or to | preh 
the viewpoint that a difference of cultura 
opportunities accounts for the variance be TI 
tween racial groups. However, there an NN, 
sufficient possibilities of differences in the age, 
cultural milieu of the two samples to ques bal ¢ 
tion their equivalence of social opportun- 
ties. The two groups remain segregated 
from each other in activities other that 
those which revolve around school. The NX 
sample, for the most part, lives in a secticall 
of town which is a homogeneous Neg 
settlement. Their cultural milieu is def: 
nitely not characteristic of white middle 
class modes of living. 

The comparisons of the NN and SN chil- 
dren lend themselves to a more clear-cu'_ 
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interpretation since the cultural variable of 
geographical residence was considered in 
the experimental design. It is contended 
that the superior educational environment 
‘and the greater opportunity for cultural 
advancement of the NN child over the SN 
child accounts for the significant differences 
ee between the two groups. 
? The present study supports Klineberg’s 
(1947) research findings that geographical 
; residency of the Negro child is an impor- 
< ra tant determinant of vocabulary perform- 
< .025)) ance. Coppinger and Ammons’ (1952) con- 


ns | tention that different norms should be 
< utilized when making intergroup compari- 
< .008 

sons involving Negroes where the members 

n adi. of the groups have different cultural back- 
" grounds is also supported. 

_ Finally, it should be pointed out that 

‘ verbal comprehension and verbal commu- 

_ nication seem to be two different functions. 

———|, Although the groups were equated on com- 

cance F prehension, they showed marked differences 


' with respect to communication. The former 
‘ task requires recognition within a certain 

context, which is quite different from ver- 
i s bal communication and definition of a word 
~ in isolation, and not in context. It would 
appear that differences between Negroes 
and whites on conventional intelligence 
tweet tests, and especially on vocabulary sub- 
d sup ‘ests, may be primarily due to failure in 
ition’ Vetbal communication rather than in com- 
or te prehension. 


cannot 
to ont 
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SuMMARY 
Three groups (30 in each group) of NW, 
Te al) NN, and SN school children, matched for 
in th age, sex, grade placement, and level of ver- 
? que*" bal comprehension, were compared on two 
ortuni vocabulary tests requiring verbal commu- 
egatel) nication. In accord with the original pre- 


& Nt } diction, the white children were superior to 
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the Negro children, and the NN children 
were superior to the SN children on these 
two measures. The results were discussed 
in relation to the possible racial and cul- 
tural geographic factors involved. 
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THE SOCIAL DESIRABILITY VARIABLE AND VERBAL Th 


LEARNING PERFORMANCE’ 


PHILIP K. BROWN? 
Sonoma State Hospital, Eldridge, California 


This article presents a study of the re- 
lation between social desirability as a 
psychological variable and learning per- 
formance and recall of college students on 
a complex verbal learning task. 

As developed by Edwards (1953a), the 
concept of social desirability has two as- 
pects. It may be considered as a property 
of the items in a personality scale such 
as the Edwards Personal Preference Sched- 
ule (EPPS). For example, it seemed ob- 
vious that a statement such as “I like to 
be loyal to my friends” reflected a more 
socially desirable attitude, and would be 
rated higher on a dimension of social de- 
sirability, than a statement such as “I like 
to avoid my responsibilities and obliga- 
tions.” Edwards (1953b) has shown that 
subjects tend to endorse the statements of 
higher scale value as characteristic of 
themselves more often than the state- 
ments rated low on the social desirability 
dimension. It follows that the social de- 
sirability variable is influential in affecting 
test responses and would need to be con- 
trolled in research using personality scales. 

The second aspect of social desirability 
appeared as an individual differences vari- 
able that reflected the tendency of a 
person to endorse the more socially de- 
sirable items as characteristic of himself. 
As such it may be considered a property 
of the person. Edwards developed the 


1 An extended report of this study may be 
obtained from the Library of Congress, Card 
No. Mic. 58-1074, for a fee of $2.00, Univer. 
Microfilms, 313 N. First St., Ann Arbor, 
Michigan. 

* This study was based on a doctoral dis- 
sertation completed at the University of 
Washington in 1957. The author wishes to 
express his thanks to C. R. Strother and the 
other members of his advisory committee for 
their help. 
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Social Desirability Seale (the SDS) to) StU 
measure the tendency of a person to “try | aa 
to look good” in a social situation. The tional 
extent to which this second aspect of social | "4 # 
desirability influenced test behavior was) 
reported by Fordyce (1956) who found this it 
that the SDS correlated at a significantly him a 
high level with every clinical scale of the 
MMPI. The SDS accounted for the major 
portion of the variance of the clinical The 
scales, leaving a minor portion of the vari- the r 
ance to be accounted for by the pathology; f™ma 
of the person or by other determinants. | rating 
the SDS measures an important determi-| “esiral 
nant of test behavior, would it also show (¢) tl 
a relationship to such behavior as is found | learnir 
in a learning task? This study addressed | 
itself to this question. 

There has been a dearth of studies re) The 
lating the SDS to learning performance} ' mer 
However, the SDS was found to have aj “nse § 
very high negative correlation with the} modifi 
Taylor Manifest Anxiety Scale (the A| &ch it 
seale) (Edwards, 1957), so that it was| "call c 
felt that results obtained from learning} Each t 
studies which related the A scale to leart- | all unl 
ing behavior, could be applied, with cav- failure 
tion, to the SDS. Taylor (1951) first | Present 


related the A scale to eyelid conditioning | “taile 
Lucas (1952) to serial learning, Ramoné | ‘ce. 17 
(1953) to complexity of verbal material, |) “4re | 
and others showed a significant relator | minute 
ship to exist between the A scale and per | Provide 
formance. 5s retu 
Some investigators used personality) tempte 
seales that correlated highly with the 4) "call s 
seale, but obtained results that were it) Selectio 
consistent with the findings of the pre} 
vious investigators (e.g., Lazarus, Dees,| The i 
& Hamilton, 1954; Heilizer, Axelrod, &| ‘aken fr 
Cowen, 1954). Such disagreement consti- . 
tuted part of the justification for making| ‘tateme 
this study. ment ri 
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Though Taylor and Spence (1952) con- 
sidered the A scale to measure “drive level” 
and used it to predict behavior within 
a Hullian framework, Edwards preferred 
to think of the SDS as measuring the tend- 
ency to “try to look good” in a social 
situation. Edwards actual definition of the 


scale value of 4.14 for the highest rated 
item. Each list learned by an S was com- 
posed of eight items from the high end 
and eight items from the low end of the 
SDS. The low scale value items (Low 
S-V) had a mean scale value of 1.49 as 
compared to a mean scale value of 3.44 
for the items of high scale value (High 


| meaning of social desirability is opera- 
rhe ' tionally determined by the score obtained 
sig] | and the correlation of the SDS with other 
personality scales and measures. However, 
_ this informal description has been used by 
him and serves our purpose well. 


S-V). The number of words varied in the 
statements between 8 and 16, with a mode 
of eleven words. There was no significant 
difference in mean number of words be- 
tween the High S-V and Low S-V items. 
The 16 items were typed on 3 X 5 cards. 

A list of 16 four-letter nonsense syllables 
were obtained from Woodworth (1938), 
modified slightly for use as stimulus syl- 
labies and were paired with the statements. 


the STATEMENT OF THE PROBLEM 


The aims of this study were to explore 
the relationship between learning per- 
formance and (a) the social desirability 
rating of statements, (b) need for social 


. If 
mi-| desirability as a personality variable, and Selection of Subjects 
sow, (¢) the interaction of social desirability, Ss were students from beginning classes 


und | learning performance, and sex. in psychology at the University of Wash- 


PROCEDURE 


The Ss in this experiment were required 
to memorize 16 statements paired to non- 
sense syllables. Learning proceeded by the 
' modified paired-associates method in which 
each item learned to the criterion of exact 
recall on two successive trials was removed. 
' Each trial consisted of the presentation of 
all unlearned items. The Ss’ success or 
failure was recorded each time an item was 
presented, and this record represented a 
| detailed account of his learning perform- 
_ ance. The Ss were also asked to recall the 
) entire set of 16 statements, starting one 
minute after completion of learning, to 
.| provide a score for immediate recall. The 

' Ss returned after a 24-hour delay and at- 
tempted recall again to provide a delayed 
Tecall score. This completed their task. 


Selection of Items 


The items used in the learning task were 
taken from the 140 statements found in the 
EPPS. The range of scale values for the 

| Statements varied from 0.60 for the state- 
ment rated least socially desirable to a 


ington. Students in these classes were given 
the 39-item SDS and were asked to volun- 
teer for a research project. Some pressure 
to cooperate existed, since one of the 
course requirements was to participate in 
some ongoing research. Final selection of 
Ss was made from among those who had 
volunteered and whose SD score fell at the 
extremes of the class distribution of SD 
scores. About one-fourth of the 600 stud- 
ents who took the SDS volunteered. No 
significant difference in mean SD score 
was found between the volunteering and 
nonvolunteering students. 

Four experimental groups of 10 Ss each 
were formed as follows: High SD males, 
Low SD males, High SD females, and Low 
SD females. The range and means of SD 
scores and Ss age is shown in Table 1. The 
mean age of the males is somewhat higher 
than that of the females due to the pres- 
ence of several veterans among the male 


groups. 


Experimental Method 


Each S was contacted by phone and an 
appointment made for individual testing. 
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TABLE 1 


RaNGE AND Means or SD Scores 
AND AGE or Ss 


Age 

Mean 

sD Range* | M 
Subjects (V = 10) |pange 2. 

High SD males 35-39) 36.3) 18-32) 21.8 

Low SD males 22-29) 25.8) 18-28) 21.0 

High SD females (32-39) 36.4) 18-20) 18.5 

Low SD females (22-28) 24.9) 18-21) 19.6 


® The males ranged higher in age due to the presence 
of several veterans in each male group. 


When the S arrived at the designated room 
he was seated at a desk facing the Z and a 
brief explanation of the purpose of the 
study was made as follows: 


We all know that intelligence isn’t the 

only thing that’s important in learning. Some 
bright people can’t seem to learn, and some 
not-so-bright people are pretty good in 
school. We’ve found that certain psychologi- 
cal variables seem to be important in learn- 
ing and remembering. We’re trying to find 
out more about these variables. 
The S was then shown a card containing 
a list of nonsense syllables while the Z 
explained the purpose of using them as 
stimuli in a paired-associates learning task, 
and the S’s responsibility in the experi- 
ment. 

On the first two trials, the E read the 
nonsense syllable aloud and showed the 
statement paired with it to the S. The S 
repeated the nonsense syllable aloud and 
read the statement aloud immediately 
afterward. The cards were shuffled be- 
tween trials to eliminate the possibility of 
serial learning. Ten seconds per card was 
found adequate during the first two trials 
to allow the S time to read it. A 20-second 
interval was used between trials to allow 
the E time to shuffle the pack. Each card 
was presented to the S by the £ placing it 
on the table in front of him. Some varia- 
tion in the length of exposure occurred for 
the different cards since it took longer to 
read some cards than others. 
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Before starting the third trial, the § the | 
was asked to guess how many trials it’ for i 
would take him to complete the learning! jtems 
task. His estimate (henceforth, the self) as th 
confidence score) was taken as a measur Th 
of the level of self confidence with which | call i 
he undertook the task. Starting with the’ were 
third trial, the S was told: doub: 


I will say the nonsense syllable aloud and) states 
allow you five seconds to recall the state” four | 
ment. If you don’t recall it, I will show it to as ju 
you, and you may read it aloud, and we wil whet! 
continue as before. like it 


Learning then proceeded by the modi-| be giv 
fied paired-associates learning method un-| ¢ePta! 
til all cards had been eliminated. When the) four | 
task was completed E engaged the S in| There 
small talk for one minute and then asked judge 
him to undertake recall of the list. The) 2’s er 
E gave the nonsense syllable and the § ‘ween 
attempted recall of the statement paired 
with it. The statements were recalled : 
the same order in which they had been) Tab 
learned in order to correct in a small way) the se 
the effect of temporal contiguity of the) experi 
last learned items. All statements thai’ of sig 
were not exactly recalled were writte! Low § 
down verbatim so that such items o Low § 
doubtful precision of recall could be later shown 
evaluated by independent judges and ac High § 
cepted or rejected. : nifican 

Upon completion of immediate recall” found 
the S was instructed to return the net Logs 
day at the same hour for the second hal for pr 
of the experiment. No other informatio Sigmas 
was given him. provins 

At the second session, the S was giver) ence be 
a few moments to become acclimated. He) the .05 
was then informed that the Z wanted temfde 
see how much of yesterday’s learning | 4 estir 
had retained. The nonsense syllables wer ‘tials it 
given in random order. Again, all state ‘ask, ra 
ments of doubtful precision of recall wer) SD fen 
recorded verbatim by the EZ. betweer 

Thus the S’s success or failure on eat!) SD fem 
item presentation was recorded, and preg 20 or x 
vided a score representing the number @/ task. T] 
trials necessary to master each item. Ti) ificant 
number of items recalled at the end @) Scores. ( 


Pr 

4 

by 

“4 

he 
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he sy the learning task was taken as the score 
Is it) for immediate recall, and the number of 
ning) jtems recalled after 24 hours was taken 
self? as the score for delayed recall. 
sur =“ The items of doubtful precision of re- 
yhich| call in both immediate and delayed recall 
1 the’ were made up into a list in which each 
‘doubtful item was paired with its exact 
The pairs were reviewed by 
tate four graduate psychology students acting 
it | as judges. They were required to judge 
> wil whether a doubtful item was sufficiently 
like its exact statement so that the S could 
nodi-| be given credit for it. The criterion of ac- 
1 un) ceptance by the Z was that three of the 
n the! four judges agree it should be accepted. 
S in) There was a total of 278 instances to be 
sked | judged, and of these only 29.1% met the 
Th! | E's criterion. The average agreement be- 
he § tween judges was 80.8%. 


REsULTS 
beer Table 2 shows the means and sigmas of 


way! the self confidence scores for the four 
the) experimental groups and gives the t tests 
that’ of significance between the High SD vs. 
iter Low SD males and the High SD vs. the 
1s dl Low SD females. No comparisons are 
late’ shown between the sexes, or between the 
d ae High SD Ss vs. the Low SD Ss, as no sig- 
) nificant differences between means was 
ecal, found for these groupings. 
nex Logarithmic transformations to correct 
| hai for proportionality between means and 
ati Sigmas shown in Table 2 resulted in im- 
| Proving the significance of the mean differ- 
ence between the two female groups from 
er 05 level to the .01 level. The self- 
- c confidence scores of the High SD females, 
ng bk a8 estimated by their guessing how many 
wer trials it would take to master the learning 
state task, ranged between 5-20 trials. The Low 
wer SD females self-confidence scores ranged 
between 9-100 trials, with half the Low 
ead’) SD females estimating it would take them 
pag or more trials to master the learning 
yer d) task. The two male groups showed no sig- 
. Tk) nificant difference in mean self-confidence 
nd @) Seores. Over the 40 Ss, a nonsignificant cor- 


TABLE 2 
Means, Siemas, anp ¢ Tests For 
Se.r-ConFIDENCE Scores 
(N = 10 per group) 


Mean 
Group Sigma Tests 
ma 
High SD males 19.3 12.7 
0.02 
Low SD males 19.4 9.8 
High SD females 9.0 5.1 
2.19 
Low SD females 29.6 29.3 


® Since the raw score means and sigmas were propor- 
tional for the two female groups, logarithmic transforma- 
tions were employed. This technique resulted in at = 
3.15, which was significant at the .01 level. The tabled 
value above, based on raw scores was significant at the 
.05 level. 


relation, r = +.14, was found between the 
Ss’ self confidence scores and their actual 
number of trials to mastery of the learn- 
ing task. 

If a High SD score meant that the S 
had a greater need to “try to look good,” 
this tendency was found only in the fe- 
males and not the males. In fact, through- 
out the entire study, the male groups as 
a whole did not differ from the female 
groups in the mean performance scores on 
any variable studied. Likewise, on the 
whole, the High SD Ss failed to differ 
significantly from the Low SD Ss in mean 
scores on any performance variable stud- 
ied. This problem will be reviewed further 
in the Discussion section. 

An analysis of variance (Table 3) based 
on the design for repeated measurements 
was made of the number of item presenta- 
tions in order to test the null hypothesis 
of no difference in the number of item 
presentations to mastery of the 16-item 
list. No significant difference in scores was 
found for the “between groups” variable. 
This was interpreted to mean that the four 
groups found the task to be essentially 
equal in difficulty. However, the “between 
item types” variable was found to be sig- 


~ 
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TABLE 3 


ANALYSIS OF VARIANCE OF NUMBER OF ITEM 
PRESENTATIONS TO MASTERY OF 
16-Item List 


Source af F Test 

Between groups (a) 3 | 724.7|1.47 
Between item types (b) 3 | 411.6/5.90** 
Groups X item types (b) | 9| 26.7) — 
Between Ss, same group | 36 | 493.8 

(Error term to test 

correlated scores, [a] 

above) 
Interaction: Pooled Ss X| 108) 69.7 

item types 

(Error term to test un- 

correlated scores, [b] 

above) 


** Significant at the .01 point. 


nificant at the .01 point. Over all Ss, the 
High S-V items were mastered in sig- 
nificantly fewer presentations than the 
lower S-V items. The explanation was 
found in an evaluation of which item types, 
e.g., High S-V, the Ss attempted to learn 
at the beginning of the task, and which 
item types were left for the end. Since the 
items were randomized before each trial, 
the Ss could give preference to any item 
type. A tally of the item types mastered 
at the beginning of the learning task 
showed that the Ss gave priority to high 
S-V items. The actual ratio was 1.96:1.00 
favoring the High S-V items over the Low 
S-V items. The ratio for the last learned 
item types showed the reversed trend. The 
Low S-V items were mastered in the ratio 
of 1.76:1.00 over the High S-V items. It 
seemed clear that the Ss selected the 
High S-V items for learning early in the 
task. 

The actual mean trials to mastery of 
the 16-item list is shown in Table 4. The 
mean trials was computed by dividing each 
S’s total number of item presentations by 
16, the number of items. It can be seen 
that for the two female groups, a sig- 
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nificant difference in mean trials to mas) Jt ¢; 
tery was reached favoring the High SD) agai, 
females. The difference in mean trials be) femg 
tween the male groups failed even to ap nific: 
proach significance. The High SD Ss dit} george 
not differ significantly from the Lov) fore 
SD Ss in their mean trials to mastery, no! petw 
did the males differ from the females} the | 
These values were excluded from the table) [pn 
An analysis of variance based on th! tered 
design for repeated measurements wa! fiden 
done on the scores for immediate recal| |Jearn 
(table not shown) to test the null hy) fewe; 
pothesis of no difference in scores for im call, ; 
mediate recall. Neither of the main varie) g 24- 
bles or their interaction accounted fori) not , 
significant proportion of the variance. 4) op ar 
similar analysis of variance for the score! show; 
for delayed recall (table not shown) failei| differ 
to show any variable accounting for a sig’ any y 
nificant proportion of the variance. There from 
fore, for immediate and delayed recall” yarial 
we are unable to reject the null hypothes: 
of no difference in performance scores. , 
Table 5 shows the mean immediat’ Tay 
recall scores for the four experimentd! 41,5, 


groups. It can be seen that when the it) giv, 
trasex mean scores are compared, tk j,yi, 
males showed no significant difference il lian f 
immediate recall, while the difference y 
mean scores between the female group. habit 
was significant at the .05 level, in favey measu 
of the High SD females. by he 

Table 6 shows the mean delayed reall fencti 
scores for the four experimental group! then | 


TABLE 4 Sore 
Means, S1emas, ¢t TESTS FOR better 
TrIALs TO MasTERY the hi 
(N = 10 per group) » SD sec 
b 
Group Mean sigma Test |) pect L 
ing tas 
High SD males 20.3 4.34 ‘ shower 
Low SD males 137 5.0 
Wwe cot 
High SD females 16.9 4.15 to the 
2.49) formul 
Low SD females 21.7 4.50 inconsi 
* Significant at the .05 level. their a 
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It can be seen that the High SD females 


| again significantly outclass the Low SD 


females, while the males showed no sig- 
nificant difference in mean delayed recall 
scores between their two groups. As be- 
fore, no significant difference was found 
between males and females, or between 
the High SD Ss and the Low SD Ss. 

In summary, the High SD females en- 
tered the learning task with a more con- 
fident attitude than the Low SD females, 
learned the 16-item list in significantly 
fewer trials, showed better immediate re- 
call, and maintained their superiority over 
a 24-hour delay. The two male groups did 
not differ significantly from each other 
on any performance variable. Though not 
shown in the tables, the males did not 


' differ in mean score from the females on 


any variable, nor did the High SD Ss differ 
from the Low SD Ss on any performance 
variable. 


Discussion 


Taylor and Spence (1952) originally 
considered the A scale as a measure of 
drive level, and used it to predict be- 
havior of Ss in accordance with the Hul- 


lian formulation: = H xX D; where 


Ez was the reaction potential, H was the 
habit or learning strength, and D was the 


' measure of drive level. It can be seen that 


by holding H constant, sEp became a 


| function of D, the drive level. It could 


then be predicted that Ss with a high A 
score (high drive level) would perform 
better than Ss with a low A score. Since 
the high A score is equivalent to a Low 
SD score, due to the high negative correla- 
tion between the two scales, we would ex- 
pect Low SD Ss to do better on our learn- 
ing task than the High SD Ss. Our study 
showed that the reverse of this prediction 
was found for the female groups and that 
we could reach no conclusion with regard 
to the male groups. The Taylor and Spence 
formulation thus led to results which are 
inconsistent with predictions based on 
their approach. 


TABLE 5 


MEAans, SiGMas, AND TESTS FOR 
IMMEDIATE RECALL 


(N = 10 per group) 


Mean 
Group No.of Sigma ¢ Tests 
Items 
High SD males 11.1 1.91 
0.49 
Low SD males 11.6 2.63 
High SD females 12.4 2.55 
2.74* 
Low SD females 9.4 2.32 


* Significant at the .05 level. 


TABLE 6 


Means, Siemas, AND t TESTS FOR 
DELAYED RECALL 


(N = 10 per group) 


Mean 
Group No.of Sigma ¢ Tests 
Items 
High SD males 10.2 2.53 
0.63 
Low SD males 9.4 3.13 
High SD females 11.1 3.32 
2.54* 
Low SD females 7.9 2.18 


* Significant at the .05 level. 


The Edwards formulation was accurate 
in predicting the females performance, but 
failed to account for the males perform- 
ance. It is possible that a clue to the ex- 
planation for this descrepancy occurs in 
the suggestion of Heilizer, Axelrod, and 
Cowen (1954). They obtained a signifi- 
cant second-order interaction “anxiety 
level X sex X experimenter” which forced 
them to calculate analyses of variance 
separately for each sex. They found that 
for the men alone, a significant difference 
in means appeared between the three 
groups tested by the three experimenters. 
Though this finding was not pertinent to 
their study, they felt it was worthy of 
mention. It suggested that such obvious 
variables as the sex of the S and the sex x 
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experimenter interactions may be import- 
ant. 

The “anxiety level X sex X experi- 
menter” interaction would very nicely ac- 
count for the findings of this study. Un- 
fortunately, the fact that only one male 
experimenter did all the testing rules out 
the possibility of evaluating this inter- 
action. Jf this interaction were in opera- 
tion during the learning precedure, then 
high drive in the Edwards sense, measured 
by a High SD score, combined with the 
presence of an authority figure of the op- 
posite sex, would set up the necessary con- 
ditions for the operation of this factor, and 
account for the results we obtained. It 
could also be predicted that if a female 
authority figure did the testing, the re- 
sults would have reversed, with the High 
SD males outdoing the Low SD males, 
while the female groups would not differ 
between themselves to any significant de- 
gree. It would be of interest to repeat this 
type of study using male and female ex- 
perimenters, arranging the experimental 
design so as to evaluate the interaction of 
drive level x Ss’ sex X experimenter. 

Another possibility that could be posited 
to explain the results is that the SD scale 
really measured the self confidence or level 
of aspiration of the Ss, and that this 
underlying factor was related to the learn- 
ing performance and accounted for the 
significantly superior learning and recall 
of the High SD females. Two arguments 
oppose this interpretation. If it were true, 
then it would follow that the High SD 
males would be similarly motivated and 
perform better than the Low SD males. 
This was not the case since no significant 
difference in performance was found be- 
tween the two male groups. Secondly, a 
nonsignificant correlation between the self- 
confidence score and actual trials to mas- 
tery was obtained, indicating this factor 
could not account for a significant amount 
of the variance obtained in the learning 


scores. 
The finding that the High S-V items 
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were learned in fewer trials than the Lov 
S-V items may be explained on the basi 
of faulty experimental methodology. Thre’ 
uncontrolled sources of error could bh 
posited to account for the results. First, 
the E read the nonsense syllables aloui 
during the learning procedure. He couk 
have given subtle vocal emphasis to nop 
sense syllables connected with the Hig)! 
S-V items, and less vocal emphasis to th’ 
Low S-V items. Second, the items wer 
recalled aloud by the S, and it was possibk. 
that the E allowed slightly more time for, 
recall of the High S-V items than for th} 
Low S-V items. The E used a stop wate 
to time his intervals which is far fron’ 
satisfactory when equal time intervals an 
desired. Related to this same problem wa 
the fact that the statements varied ir 
length and therefore required differen 
lengths of time merely to say them aloud 
Thirdly, it was possible that by use o 
expressive mannerisms of which he wa 
unaware, the EF may have communicate! 
the idea to the S that he favored th 
High S-V items, thereby encouraging th 
Ss to greater effort toward learning th 
High S-V items. 

Correction of these sources of error i 
possible. Statements of equal length coul 
be developed. A memory drum to preset! 
each item for an equal interval of tim 
could be used to eliminate any tendency 
to favor one item type over another. Als 
an auditory signal could be used to tim 
the five-second interval of recall. Thes 
mechanical conveniences would leave th 
E free to attend to the recording of dati 
and any other duties involved in the & 
perimental procedure. Lastly, the E couk 
sit out of the line of vision of the S bu 
still be able to observe his behavior. 


SuMMARY AND CONCLUSIONS 


This study attempted to study the r 
lationship between social desirability as‘ 
psychological variable and learning per 
formance and recall of complex verb 
material, using college students as subject: 
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It was found that social desirability, as 
a property of the items, accounted to a 
significant degree, for the superiority of 
the High S-V items over the Low S-V 
items in learning, but not in recall. 

The social desirability variable, as a 
property of the person, was found to dis- 
criminate to a significant degree, good 
from poor performers in the female college 
population, though it failed to separate the 
male students on the basis of their learn- 
ing performance. 

Explanations of the findings were offered 
and alternative interpretations of the re- 
sults discussed. 
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RELATIONSHIPS AMONG SEVERAL MEASURES OF 
PUPIL ADJUSTMENT" 


IRA J. SEMLER 
University of Texas 


Currently the employment of paper-and- 
pencil personality, adjustment, and rating 
seales with school children is widespread 
and increasing. While sophisticated investi- 
gators and test constructors recognize no 
one-to-one relationship between test results 
and actual classroom adjustment, there ex- 
ists a less critical attitude on the part of 
many educators. Frequently naive inter- 
pretations of group test results lead to 
gross discrepancies in judging an individual 
pupil’s adjustment. A common error is 
failure to recognize that adjustment is a 
function of many variables and that differ- 
ent tests may measure different aspects of 
adjustment. An unwarranted assumption 
is often made that pupils who are classified 
at one level of adjustment by one test are 
similarly classified by other types of meas- 
ures. While several investigations have at- 
tempted to determine relationships be- 
tween certain adjustment measures and 
pupil characteristics (Bedoian, 1953; Bon- 
ney, 1943; Grossmann & Wrighter, 1948; 
Scandrette, 1953; Sewell & Haller, 1956), 
Tindall (1955) has pointed out that few 
studies have used a variety of approaches 
to the measurement of adjustment. An in- 
vestigation which demonstrates patterns of 
interrelationships between certain adjust- 
ment instruments and measurement tech- 
nics would also provide information for 
more effective interpretation of test results. 

The purpose of this investigation was to 
determine patterns of interrelationships 
among three separate approaches to the 
measurement of adjustment; namely, peer 
acceptance, self-report, and authority fig- 
ure (teacher) ratings. Additionally, sex 
differences in relationship patterns and 


* Based on a portion of the data gathered 
for a dissertation at the State University of 
Iowa. 


generality of relationships across class- 
rooms was studied. 


METHOD 


Subjects 


The Ss were 247 boys and 236 girls, some 
483 children who were enrolled in 18 fifth- 
grade classrooms in the public elementary 
school system of a Midwestern city of 
about 80,000 population. 


Measures 


Two measures of peer acceptance were 
employed. The first instrument used a so- 
ciometric (SOC) procedure patterned after 
that described by Jennings (1948). The 
second was the Ohio Social Acceptance 
Seale (OSA). This scale provides every 
S an opportunity to rate every other S in 
his group on a friendship scale which ranges 
from 1 (My very, very best friends) to 6 
(Dislike them). 


Self-descriptions (self-report) were total 


scores obtained from the California Test of 
Personality (CTP), Elementary Series, 


Form AA (California Test Bureau, 1953). ~ 


Teacher ratings (TRt) of pupil adjust- 
ment were obtained on Items I, II, VI, VII, 
and XI of the Rating Scale For Pupil Ad- 
justment (Science Research Associates, 
1953). Ratings from the first four items 
were pooled to provide an adjustment 
score, while Item XI was used as a “school 
conduct” rating. 

In addition to the foregoing measures of 
adjustment, data regarding intelligence, 
achievement, and parental occupation were 
obtained from the child’s cumulative rec- 
ord. Intelligence quotients were based on 
the Otis Quick Scoring Mental Ability 


*Ohio State Department of Education, 
The Ohio Scholarship Tests (undated). 
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Tests; the median grade equivalent earned 
on the Stanford Achievement Test was used 
as a general indicator of achievement level. 
Parental occupation was rated, using a 21- 
point modification of the Warner Index of 
Status Characteristics Occupation Scale. 


Procedure 


Teacher ratings from the SRA Rating 
Scale For Pupil Adjustment were secured 
along with cumulative record data before 
classroom testing in order to prevent rater 
contamination from knowledge of other 
adjustment measure results. 

A planned schedule for two testing ses- 
sions one week apart enabled classroom 
testing for all 18 classrooms to be com- 
pleted within a two-week period. Absentees 
were tested within three days after the 
completion of classroom testing. The same 
procedure was followed in all classrooms. 
The first session of 35 min. was used to ad- 
minister the SOC and to start the CTP; 
the second session of 35 min. was used to 
administer the OSA and to complete the 
CTP. This was adequate time for all Ss to 
complete the two peer acceptance instru- 
ments and the self-report. 


REsuULTs AND Discussion 


The major results of this investigation 
are summarized in Table 1. The pattern of 
obtained correlations was similar for girls, 
boys, and combined sexes and reflected a 
positive and reliable relationship between 
each of the pupil adjustment measures. The 
results in Table 1 indicate that the various 
instruments and measurement technics do 
not classify pupil adjustment in the same 
way even though the several approaches 
appeared to contain a common element 
which could be labelled “adjustment.” The 
two peer acceptance instruments were 
highly related, which is not surprising since 
they are operationally similar. The next 
highest relationships were obtained be- 
tween the two peer acceptance measures 
and the authority figure ratings, which sug- 
gests a common “social reaction” and an 


TABLE 1 
RELATIONSHIPS AMONG SEVERAL MEASURES 
or PupiL ADJUSTMENT: INTERCORRE- 
LATIONS BY Sex FoR Groups 


soc OSA TRt 
OSA 
Girls .67 
Boys -70 
Total .68 
TRt 
Girls .49 54 
Boys .40 .53 
Total .45 .54 
CTP 
Girls .30 .35 
Boys .27 .37 .27* 
Total .37 .37 


Note.—SOC is sociometric; OSA is Ohio Social Ac- 
ceptance Scale; TRt is Rating Scale For Pupil Adjust- 
ment; CTP is California Test of Personality. Top value 
is correlation based on all girls. Middle value is correla- 
tion based on all boys. Bottom value is correlation based 
on sexes combined. All correlations are reliably different 
from zero at the .01 level of confidence. 

* In this one instance the correlations for the two 
sexes differs at the .05 level of confidence. 


awareness on the part of teachers that so- 
cial acceptability is an important aspect of 
adjustment. The lowest correlations were 
obtained between the “social” (SOC, OSA, 
and TRt) approaches and the self-report. 
Since all correlations are reliably different 
from zero, the conclusion is that pupils who 
were more acceptable socially were rated 
as better adjusted by their teachers and 
rated themselves higher (well adjusted) on 
a self-report. 

The pattern of relationships was similar 
for girls and boys, with the exception of a 
significantly higher relationship between 
teacher ratings of girls’ adjustment with 
girls’ self-reports (CTP) than between 
teacher ratings of boys’ adjustment with 
the boys’ self-reports. Since 15 of the 18 
teachers were women, a further separation 
of relationship data (median correlations) 
seemed worthwhile and is presented in 
Table 2 according to sex of teachers as well 
as sex of pupils. The significant sex differ- 
ences in correlations noted above is diffi- 
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TABLE 2 


MeEpIAN CoRRELATIONS BETWEEN TEACHER 
RatiIncs oF ADJUSTMENT AND OTHER 
MEASURES OF ADJUSTMENT ACCORDING 
To Sex oF TEACHER AND SEX OF PuPIL 


soc CTP 


Female teacher 
ratings of ad- 
justment 
(TRt) 
(N = 15) 
Girls .55** .66** .45* 
Boys .38 .65** .34 


Male teacher rat- 
ings of adjust- 
ment (TRt) 
(N = 3) 
Girls .35 .39 .42 
Boys .68** .68** .46* 


Note.—Significance levels based on minimum class 
size of N = 21. 

* Reliably different from zero at the .05 level of con- 
fidence. 

** Reliably different from zero at the .01 level of con- 
fidence. 


cult to explain since correlation patterns 
involving teacher ratings of same and op- 
posite sex pupils’ adjustment and the pu- 
pils’ score on a self-report were quite alike. 
However, Table 2 clearly shows that rela- 
tionships involving teacher ratings and so- 
ciometric status differed depending upon 
the sex of the teacher and the sex of the 
pupil. The relationship between adjust- 
ment ratings (TRt) and peer acceptance 
(SOC) was higher for girls than for boys 
when female teachers were ratings, and the 
reverse was true when male teachers rated 
adjustment. All correlations involving 
teacher ratings of same sex pupils’ adjust- 
ment were reliably different from zero at 
the .05 level of confidence, whereas only 
one correlation involving teacher ratings of 
opposite sex pupils’ adjustment was sig- 
nificantly different from zero. 

It is reasonable to suppose that each 
classroom was a unique social entity and 
that the total correlations reported in 
Table 1 did not reflect accurately the rela- 
tionships for each classroom. Table 3 sum- 
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marizes the results of the analyses of co-) De 
variance which tested the homogeneity of | data: 
the 18 classroom regression coefficients for | inter« 
all combinations of pupil adjustment meas- | ingly 
ures. The analyses support the use of gen- | patte 
eral regression lines for combinations of the | In alt 
sociometric with teacher ratings (SOC- within 


TRt) and with the self-report (SOC-CTP). | were! 
Significant F tests for all other pupil ad- | and t 
justment combinations indicated heteroge- | Plicat 
neity of classroom regression data. It fol- | furthe 
lows that separate relationship data specific Tak 
to the classroom should be used when com- | 
binations of these adjustment measures are | ment 
considered. study. 
Table 4 was prepared to summarize cer-| media 
tain individual classroom statistics. It is Wh 
quite noticeable that the median correla- Table 
tion was a poor indicator of average rela- | ‘teri 
tionship, since the range of classroom cor-| “djust 
relations was quite wide in every case. This | betwe 
wide range of correlations involving teacher | deseril 
ratings might be accounted for partially | “ns. 
by individual teacher difference in judging | ‘tom t 
adjustment. For instance, correlations be- 
tween teacher ratings of adjustment and 
teacher ratings of school conduct ranged| Mep1 
from .21 to 87 (not tabled), which indi- | 
cated that some teachers tended to equate _ 
adjustment with “school conduct.” Of — 
course, the low correlations within this OSA 
range might mean that some teachers rec-| Medi 
ognized that well behaved pupils are not| Ram 
always well adjusted and that a little non- 
conformity is not necessarily symptomatic 
of poor adjustment. 


TABLE 3 
REsuLTs FROM SUMMARIES OF ANALYSES 
or CovARIANCE FOR Various Com- 
BINATIONS OF PupiL ADJUSTMENT 


a 
4 


Zz 
° 


Measures: F Tests or HomoGEeNe- Rang 

ITy oF 18 CLassroom REGRESSION No. 
COEFFICIENTS 

Note.. 

soc OSA TRt Moms. M 

Bottom v 

OSA 14.94" tions in tt 

TRt 1.14 5.56** 

CTP 77 4.60 2.97 tence fo 

© Significant at lone than the .06 level of confidene. 


** Significant at less than the .01 level of confidence. 


wine 
q 
2 
4 
* 


co- Despite the wide range of relationship 
of | data across classrooms, inspection of the 18 
for | intercorrelation matrices revealed strik- 
as- | ingly similar patterns which followed the 
en- | pattern of median correlations in Table 4. 
the | In almost every classroom the correlations 
within the “social judgment” approaches 
were higher than between these approaches 
and the self-report. The generality and ap- 
ge- | Plicability of this finding should be explored 
fol- | further. 

ific |, Table 5 summarizes the relationships be- 
m- ' tween the various measures of pupil adjust- 
are | ment and pupil characteristics used in this 
study. Little difference was noted between 
er- | median and total correlations. 

. jg) While the linear correlations reported in 
sJa- | Table 5 appeared to show that pupil char- 
ela- | acteristics were poor “predictors” of pupil 
or- | adjustment, it is possible that relationships 
‘his | between some of these measures might be 
her! described better by curvilinear correla- 
ally | tions. For example, a scatterplot of results 
from the Ohio Social Acceptance Scale ver- 


— 


ring 
be- 
and | TABLE 4 
ged i MepiaN CORRELATION AND RANGE OF 
| CoRRELATIONS FOR 18 CLASSROOMS 
late soc osa | TRt 
Of 
this OSA 
not. Range to .84 
tis No. Sig. (18) 
atie TRt 
Median .50* .62** 
Range -23 to .65).25 to .84 
No. Sig. (15) (14) 
SES 
CTP 
T Median .30 -42 .39 
Range .05 to .49).19 to .63)— .07 to .68 
’ No. Sig. (4) (11) (10) 


Note.—Top value is median correlation of 18 — 
_ Toms. Middle values are ext of correlati 
| Bottom value, in parenthesis, is the number of aie 
q 


tions in the range which were reliably different from zero 
at the .05 level of confidence. 

* Reliably different from zero at the .05 level of con- 
ss fidence for N = 21. 
ah : ** Reliably different from zero at the .01 level of con- 
fidence for N = 21. 
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TABLE 5 


SumMARY OF RELATIONSHIPS BETWEEN 

Pupt, ApsJUSTMENT MEASURES AND 

Asitity, ScHoot ACHIEVEMENT, AND 
PARENTAL OccUPATION 


soc | osa | TRe | CTP 


Intelligence (Otis) 


Median .29 |.35 |.34 |.36 
Total .33°*| .33°*| .33°° 
Achievement (Stan- 
ford) 
Median .22 |.39 |.48* |.38 
Total .22**| .34**| .40**| .40** 


Social class (Paren- 
tal occupation) 
Median .2% |.28 |.20 |.14 

Total .22**| .23°*/ .26°*| .20°* 


Note.—Median refers to median correlation of 18 
classrooms. Total refers to correlation of total groups 
without regard to classroom. 

* Reliably different from zero at the .05 level of con- 
fidence. 

** Reliably different from zero at the .01 level of con- 
fidence. 


sus median grade equivalents from the 
Stanford Achievement Test revealed a cur- 
vilinear trend. The computed eta for this 
seatterplot was .71 in comparison with ob- 
tained linear correlations of .34 (total) and 
.39 (median). The trend indicated that 
underachievers were not as socially accept- 
able as achievers or overachievers. Inspec- 
tion of the scatterplot also showed greater 
variation in peer status for overachievers 
than for underachievers. The prediction of 
curvilinear trends between measures of in- 
telligence or achievement and other meas- 
ures of pupil adjustment seems reasonable 
and might well apply to other elementary 
school populations. 

The effective use of group test results re- 
quires that interpreters be aware not only 
of extent of relationship among different 
approaches to the measurement of adjust- 
ment but, also, that such relationships may 
be relatively unique to individual class- 
rooms. In addition, the interpreter should 
give especial consideration to teacher rat- 
ings, since sex of teacher with respect to 
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sex of pupil may be an important factor 
when assessing the individual pupil’s ad- 
justment. 


SuMMARY 


This investigation obtained measures of 
relationship among several approaches to 
the measurement of pupil adjustment. Cu- 
mulative record information (intelligence, 
achievement, and parental occupation) 
plus teacher ratings of pupil adjustment 
for 483 fifth-grade children in 18 class- 
rooms were obtained in advance of a care- 
fully planned classroom testing schedule. 
The classroom testing schedule obtained 
successive measurements of peer accept- 
ance (SOC and OSA) and pupil responses 
on a self-report test (CTP). 

The findings in brief were: 

1. A reliable and positive but moderate 
set of intercorrelations among the several 
pupil adjustment measures was obtained. 

2. Significant sex differences were evi- 
dent in correlations involving teacher rat- 
ings and self-reports. 

3. Sex differences in relationship pat- 
terns depended upon sex of teacher when 
teacher ratings and sociometric status cor- 
relations were considered. 

4. General regression data did not de- 
scribe individual classroom relationships. 

5. One peer acceptance instrument 
(OSA) appeared to be related to certain 
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pupil characteristics in a curvilinear fash- 
ion. 
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THE EFFECT OF TEACHER-PUPIL CONTACTS 
INVOLVING PRAISE ON THE SOCIOMETRIC 
CHOICES OF STUDENTS' 


NED A. FLANDERS anp SULO HAVUMAKI 


University of Minnesota 


Many teachers have used sociometric 
data to assign a choice value to any indi- 
vidual in a group. The individual who re- 
ceives the most choices has the highest 
choice value, and the rest of the group can 
be arranged in rank order. Very little has 
been said about how teachers can influence 
the choice value of a student. 

The present study is an attempt to see 
if teacher-pupil contacts involving praise 
will affect the sociometric choices received 
later on. The hypothesis to be tested is 
that contacts involving praise, given by a 
prestige figure, will increase the choice 
value of the student. 

The following conditions must be con- 
trolled to test this hypothesis: 

1. The group situation must involve an 
attractive activity and the praise given 
by the prestige figure must be centered on 
the relationship between an individual’s 


| behavior and the major activity of the 
| group; merely to talk about the individ- 


ual per se would not be a fair test. 

2. Information given to the group must 
be controlled by the teacher so that cer- 
tain individuals can be singled out for at- 
tention. 

3. Any acquaintanceship patterns exist- 
ing in the group prior to the experimental 
discussions must be identified so that it 
can be demonstrated that the factor of 
personal acquaintance did not bias the re- 


sults. 


*The authors wish to express their ap- 
preciation to the Minneapolis and St. Paul 
school officials and students whose cooper- 
ation was essential to the success of the 
study. This research was carried out as part 
of the research program of the Laboratory 
for Research in Social Relations, University 
of Minnesota. 
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4. The criterion question used to elicit 
sociometric choices must be related to the 
major activity of the group and appear to 
have consequences in the future activities 
of the group. This eliminates such ques- 
tions as, “Who contributed the most to 
this discussion ?” 

5. The prestige figure controlling the 
flow of information must be liked by the 
group, or at least not disliked. 


PROCEDURE 


In a study conducted at the University 
of Minnesota, Laboratory for Research in 
Social Relations, 33 groups of 10 subjects 
each (hereafter referred to as Ss) were 
involved in an experiment in which socio- 
metric choices were made. In all groups 
the Ss consisted of approximately an equal 
number of boys and girls from tenth-grade 
classes of Minneapolis and St. Paul high 
schools. It was the belief of these groups 
that they were to appear in a Quiz Kid 
contest between schools from Minneapolis 
and St. Paul. Each group was to decide 
whether their group would appear on a 
radio or TV quiz program, either alterna- 
tive being possible. This discussion was 
led by a teacher-trainer, not known to 
any of the students, whose function it was 
to help the students reach a decision and 
then train them for the performance. 
Elaborate arrangements were made to im- 
press all Ss that the Quiz Kid contest was 
an attractive and exciting experience. 
Practically all Ss approached the problem 
of choosing between radio and TV pro- 
grams seriously. At the very end of each 
experimental session when all Ss learned 
there would be no contest, sincere ex- 
pressions of disappointment were dis- 
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played in each group. Judging from the 
manner in which all sessions were con- 
ducted and the behavior of the Ss, there is 
no doubt in the minds of the authors that 
the prospect of participating in the contest 
was attractive and stimulating to the Ss. 
Thus, these group discussions satisfacto- 
rily meet the first half of Condition 1 men- 
tioned previously. 

Besides the attractiveness of the group’s 
activity, Conditions 1 and 2 refer to the 
nature of the teacher-trainer and the 
method of giving this information. The 
teacher-trainer’s behavior for 17 groups 
was individually oriented, that is, he called 
on certain individuals by name and 
praised their contributions to the dis- 
cussion. The teacher behavior thus served 
to provide praise for certain individuals 
in the group by focusing attention on these 
individuals in the individually oriented 
condition. The individuals attended to in 
this manner were always those who hap- 
pened to sit in the odd numbered seats 
around a U shaped table. The original 
choice of seats was up to the Ss. The pro- 
cedure of calling only on Ss seated in the 
odd numbered seats effectively restricted 
. all verbal communication to these Ss and 

the teacher-trainer. While many Ss seated 

in the even numbered seats raised their 
i hands indicating a desire to speak, they 
a were not called on. Thus the teacher- 
trainer’s praise reactions were clearly 
centered on Ss who were easily identified 
during the individual approach. 

In the remaining 16 groups the teacher- 
trainer behaved in a group oriented fash- 
4 ion and praised the group as a whole for 
% the ideas contributed by individual mem- 
bers. He called on no individuals by name 
and anyone who wished to talk was al- 
lowed to do so. 

The verbal behavior of the teacher- 
trainer was categorized from voice re- 
cordings of all the group discussions. These 
x categories consisted of a simple analysis 
of praise statements made by the teacher- 
trainer in response to student contribu- 
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tions. In the individually oriented condi- 
tion most praise statements were made > 


with reference to specific individuals; in 


the group oriented condition the praise 
statements were never coupled with an 
individual’s name and were interpreted 
in terms of the group whenever this could 
reasonably be done. 

Praise statements became standardized 
during pilot runs prior to conducting the 
discussions reported here. Examples of 
statements used in the individual ap- 
proach are: “I’m glad John mentioned 
that point because it is very important for 
this group to consider in order to make a 
good decision”; “That’s a very good sug- 
gestion, Mary”; “That’s very true. I think 
Jim will be a big help to this team.” 
During a group approach it was not al- 
ways easy to turn a praise statement away | 
from the individual toward the group, 
for example: “That’s an important point. 
This team has good ideas”; “Good! A 
second good point; can anyone [looking 
at the group] add any more?” Occasion- 
ally, during the group approach only, the 
teacher-trainer would be interrupted after 
saying, “That’s a good idea...” before 
praise could be turned toward the group. 
Thus, some praise statements during the 
group approach were classified as in- 
dividual praise statement. An example of 
a praise statement that is clearly group 
oriented is: “This group is going to make 
a fine team.” 

The comparison of the praise state- 
ments of the teacher-trainer’s individual 
and group oriented behavior patterns 
shown in Table 1 indicates that neither 
pattern is 100% consistent. Actually, the 
differences are greater than indicated in 
Table 1. For example, the data do not 
reveal such important aspects of the 
teacher-trainer’s behavior as looking di- 
rectly at an individual, calling on him by 
name, and praising his ideas in association 
with his name, as contrasted with looking 
generally at the whole group, not calling 
on any one individual, and never ass0- 
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TEACHER PRAISE AND SOCIOMETRIC CHOICE 


ciating the name of an individual with 
praise. The two patterns of teacher be- 
havior were clearly different to an ob- 
server and easily identified by the tape 
recording with 100% accuracy. 

Thus far the discussion has dealt with 
the first two conditions necessary to the 
study; next, Conditions 3 and 4 will be 
considered. After each group was seated, 
they were first asked to list the names of 
all the Ss with whom they were acquainted 
and, in addition, place an “X” beside the 
name of any person who was a good friend. 
For all experimental groups the indication 
of friendship was so infrequent (a total of 
six friendship pairs) that they are not 
reported here. This was undoubtedly due 
to the cooperation of school authorities 
who were asked to choose Ss for each 
group from separate homerooms. The pre- 
discussion acquaintanceship data show 
that 477 Ss were acquainted with the 85 
Ss who sat in the odd numbered seats be- 
fore the discussion started; similarly, 
505 Ss were acquainted with those Ss who 
sat in the even numbered seats. Of the 
total (928) acquaintanceships reported, 
48.6% involved Ss in the odd numbered 
seats, 51.4% involved Ss in the even num- 
bered seats. The hypothesis that this dif- 
ference is significant can be tested by chi 
square. Since chi square = 0.012, the dif- 
ference is clearly not significant. In ad- 
dition, the Ss seated in the even num- 
bered seats are slightly better known, so 
that a more rigorous test of the main hy- 
pothesis is possible. 

After each discussion the teacher-trainer 
collected sociometric information in re- 
sponse to the statement, “List five mem- 
bers of the group whom you think would 
be good program participants.” It was 
explained that two teams, size 10 each, 
would result in 20 Ss being in the studio 
ready to answer the quiz questions, and 
this was far too many. Therefore, each 
team would have five program partici- 
pants who would answer the questions 
and five helpers who could help the partici- 
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pants. The Ss were told that their own 
group choices would be used to identify 
the program participants. The results of 
their selection are shown in Table 2. 


REsULTsS 


The data in Table 2 follow the predic- 
tion that Ss seated in the odd numbered 


TABLE 1 


DISTRIBUTION OF TEACHER 
Praise STATEMENTS 


Individual Group 
Condition Condition 
(17 groups) (14 groups)* 
N % N % 
Praise to Individ- | 114 | 79.1) 28 30.1 
uals 
Praise to Group 30 | 20.9) 65 | 69.9 
Totals 144 | 100.0) 93 | 100.0 


* Two groups lost as recordings were inadvertently 
eraeds. 


TABLE 2 


DISTRIBUTION OF CHOICES FOR 
PRoGRAM PARTICIPANTS 


Individual Group 

Treatment | Treatment 

(17 groups) (16 groups) 

N % N % 
To Odd Seats 496 | 59.3) 384 48.6 
To Even Seats 341 | 40.7) 406 51.4 
Totals 837*| 100.0) 790**| 100.0 


Note.—Since some Ss made only four choices, * does 
not equal 5 X 17 = 850 and ** does not equal 5 X 16 = 
800. 


TABLE 3 


DesirE TO CoNTINUE WITH SAME 
TEACHER-TRAINER 


Individual Group 

Treatment Treatment 

N % N % 
Yes 141 82.9 136 85.5 
Not Sure 27 «15.9 20 12.6 
No 2 1.2 3 1.9 
Totals 170 100.0 159* 100.0 


* One S failed to answer this question. 
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seats in the individual treatment and given 
praise by the teacher-trainer would re- 
ceive more choices than would occur by 
chance, and the same comparison in the 
group treatment would show no such 
trend. The significance of 59.3% distri- 
bution compared with a hypothetical 50- 
50 split can be calculated by using the 
standard error of correlated proportions. 
Testing the significance of a 59.3-40.7 
split by this method yields a critical ratio 
of 2.98, indicating a 0.0028 probability 
that this difference could have occurred by 
chance. If one ignores the probable inter- 
dependence of the choice data and uses a 
chi square test on the data in Table 2, 
chi square = 19.063, indicating a proba- 
bility of 0.001 for one degree of freedom. 

The fifth condition listed at the begin- 
ning of this article refers to the require- 
ment that the Ss like or at least not 
dislike the teacher-trainer. After the dis- 
cussions, the teacher-trainer would leave, 
and the students were informed by the 
host-experimenter that it was not neces- 
sary for them to continue with the teacher- 
trainer who had led the discussions unless 
they wanted to. They were instructed to 
indicate their feelings toward the teacher- 
trainer by checking the item, “Would you 
like to continue working with the same 
teacher-trainer?” 

Table 3 indicates that there is little 
difference between the two conditions of 
teacher behavior as to the degree of liking 
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or of disliking of the teacher-trainer. The 
data for 329 Ss are shown in Table 3. 

The hypothesis that the distribution of 
answers for the individual treatment are 
not significantly different from the distri- 
bution of answers in the group treatment 
can be tested by chi square. A chi 
square = 0.966 for two degrees of free- 
dom fails to reject this hypothesis. 


SuMMARY 
In a decision making experiment socio- 


metric data were collected from 330 tenth- 
grade students after they had interacted | 


with a teacher who praised their partici- 
pation. In 17 10-man groups the teacher 
interacted only with Ss seated in the odd 
numbered seats. In another 16 groups, all 
Ss were allowed to talk and statements 
involving praise were directed to the group 
as a whole. In the former situation the Ss 
in the odd numbered seats received sig- 
nificantly more choices than did the Ss 
seated in the even numbered seats. In the 
latter situation, the difference between Ss 
in the odd and even numbered seats was 
not significant. 

There are undoubtedly many different 
ways to increase the choice value of stu- 
dents in a classroom. The results of this 
experiment indicate that teacher-pupil in- 
teraction involving praise that is suppor- 
tive and constructive is likely to increase 
the choice value of a student indicating 
greater acceptance by his peers. 


(Received August 28, 1959) 
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THE JOPLIN PLAN OF READING VS. A 
TRADITIONAL METHOD 


ELMER F. MORGAN, JR. 
Monroe County Schools 


anp GERALD R. STUCKER 
Dundee Elementary Schools 


The research described here was de- 
signed to determine whether the Joplin 
plan of reading (Floyd, 1954) would pro- 

' duce significantly superior results where 
teachers have been randomly assigned to 
experimental and control groups, and 
where sex, method, IQ, and initial reading 
ability have been carefully controlled. 

As a rather explicit approximation of 
the Joplin plan is described in the Pro- 
cedure, a description will not be under- 
» taken here. It should be noted, however, 
_ that since Floyd’s report the Joplin plan 
_ has been carefully reviewed and adapted 
’ by a substantial number of school sys- 

tems. The plan has achieved national 

recognition through its popularization in 

a 1957 issue of the Saturday Evening Post. 

While many of the school systems using 
* this plan claim superior results at the 

middle grades, there appears to be no pub- 

lished statistical research to validate these 
claims. 


SUBJECTS AND PROCEDURE 


Table 1 presents the sex, IQ, and initial 
reading characteristics of 90 matched pairs 
of fifth- and sixth-grade Ss of a rural con- 
solidated school located in a southeastern 
Michigan county. The Ss of Table 1 were 
those Ss out of a total population of 226 
fifth and sixth graders who could be suc- 
cessfully matched on two separate meas- 
ures of initial reading ability. The two 

_ measures of reading ability were the Dur- 
rell-Sullivan, Form A, and California, 1957 


_ W Series, reading achievement tests. 

At each grade the Ss were divided into 
_ fast and slow experimental and control 


groups by the following procedure. Those 
Ss scoring above the expected grade norms 
of 4.8 and 5.8 for next year’s fifth and 
sixth grades (groups were formed at the 
end of the 4th and 5th grades) were desig- 
nated fast achievers. Those Ss scoring be- 
low this cutting line were designated slow 
achievers. Ss at the fast levels were 
matched on the two separate reading 
measures and were then randomly as- 
signed to either a control or experimental 
group. The procedure was repeated at the 
slow levels. This procedure resulted in 
matched experimental and control groups 
of two levels at each separate grade. 

While the Ss were not matched across 
the rows on an intelligence test as they 
were on the two reading measures, the t 
test for matched groups revealed that the 
above groups could not be viewed as be- 
ing other than from a common popula- 
tion also with respect to California (57 
S-Form) IQs. 

The eight prospective teachers of the 
above groups agreed to strict randomiza- 
tion with respect to teaching fast and 
slow experimental and control groups.’ 
Randomization was carried out by draw- 
ing class assignments from a hat. 

The plan in operation called for teach- 
ers of the control groups to teach read- 
ing as they always had in their tra- 


*The writers wish to thank the fifth- and 
sixth-grade teachers of the Dundee, Michi- 
gan, elementary schools whose keen desires 
for experimentation made this study not 
only possible, but mandatory. Especially 
helpful was Harry Nelson, Superintendent, 
who gave both financial and administrative 
assistance. 
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ditional self-contained classrooms. The 
experimental groups separated at a set 
hour, and each S went to a reading class 
of his or her own level. A reading period 
of 50 minutes was set for both the experi- 
mental and control groups. This period 
was carefully controlled for all groups and 
reading for all groups was carried out at 
approximately the same morning hour. 
The plan was in operation for the entire 
1958-1959 academic year” 


LrwitTaTIONs oF Stupy 


Because this was a single-school experi- 
ment, several limitations that were in- 
herent in the nature of the design could 
not be escaped. 

1. The factor of control groups meant 
that the time of four teachers was used 
up for these groups. This seriously limited 
the number of experimental reading 
groups of different levels that could be 
formed. For this reason, advanced read- 
ing groups beyond the intact above aver- 
age sixth-grade group could not be formed. 
Thus, with the exception of having five 
highly gifted fifth graders enter the group, 
this group was not truly an experimental 
group. 

2. The use of two levels of initial read- 
ing achievement meant that the experi- 
mental groups were not matched against 
truly heterogeneous control groups, but 
instead were matched against groups that 
were already somewhat homogeneous with 
respect to reading. 

3. Because of the limitation imposed by 


? During the summer of 1958, book carts 
and supplementary reading materials were 
purchased for all eight classes. A new and 
unfamiliar basal reader series was pur- 
chased for the experimental classes. All 
participating teachers had equal access to a 
well stocked storeroom of supplementary 
readers, reading materials, and hundreds of 
library books from the county library. The 
participating teachers asked that no one 
teacher be given any supervision, as they felt 
that teaching conditions should remain the 
same for all. None was given throughout the 
entire year. 
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TABLE 1 
Sex Composition, Mean IQ, READING 
MEANS, AND STANDARD DEVIATIONS OF 
Fast AND Stow EXPERIMENTAL 
AND ContTROL Groups 


Sexand1Q California | Durrell. 

Group 
10 Mean! SD |Mean| SD 
5th S-E | 13 | 7 | 96.45/4.17| .55/3.90! .50 
S-C | 14| 6 | 97.30/4.29| .39)3.83) .50 
F-E | 14 | 13 |110.815.48 7 
F-C | 10 | 17 |109.00'5.67| .79\5.51! .71 
T-E | 27 | 20 4.92) .99/4.77| .95 
T-C| 24 | 23 5.08 .94/4.76) .97 
6th S-E|11| 5 .72/4.78) .66 
S-C | 11 | 5 | 98.015.39| .78'4.70) .61 
F-E | 11 | 16 {116.93'7.46| .9217.30) .76 
F-C | 16 | 11 |117.67/7.49! .93'7.25, .71 
T-E | 22 | 21 6.70/1.31/6.36 1.42 
T-C | 27 | 16 6.70,1.346.30,1.40 


Note.—S-E = Slow Experimental; S-C = Slow Con- 
trol; F-E = Fast Experimental; F-C = Fast Control; 
T-E = Total Experimental; T-C = Total Control. 


the use of control groups, a considerable 
body of experimental Ss received reading 
instruction as well as all other instruction 
from the same experimental teacher. On 
the surface, it would appear that there 
was no difference in methods for these Ss. 
This limitation, however, is not as severe 
as it might at first appear, as the nucleus 
of Ss remaining in their homerooms were 
still experimental Ss, all of which were 
now receiving instruction in classes where 
reading range had been reduced by several 
grades. At the fast fifth-grade level, for 
example, the placing of 13 Ss into reading 
classes outside their homeroom reduced 
the reading range from 4.8-9.2 to 5.5-72. 
The Ss coming into this group from the 
slow sixth grade were all within the 55 
to 7.2 reading range. At the other levels, 
a similar reduction in reading range was 
obtained. It was possible, however, to 
check on the above limitations by testing 
the null hypothesis that there was no dif- 
ference in reading results for those experi- 
mental Ss leaving their homerooms for 
reading instruction and their matched 
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counterparts in the control classes. The 
same hypothesis was stated for those ex- 
perimental Ss remaining in their home- 
rooms for reading instruction. Should sig- 
nificant results be obtained in favor of the 
experimental groups, failure to reject the 
null hypothesis of equal effectiveness of 
method for those experimental Ss leaving 
their homerooms would tend to lessen the 
meaning of any significant trends. 

4. When the above limitations are taken 
into consideration, it can only be con- 
cluded that this experiment is not a fair 
representation of the Joplin plan at its 
best. Because of these limitations, trends 
significant at less than the .10 level will 
be taken as trends indicating the worth- 
whileness of further experimentation. 


RESULTS 


At all levels, the ¢ test for matched 
groups and the Wilcoxon matched-pairs 
signed ranks test (Siegel, 1956), a non- 
parametric test, were used to test the 
null hypothesis that the two different 
teaching methods produced no significant 
difference in reading results. Since the 
number of Ss was small, and since there 
was no ground to support the assumption 
of normality, the nonparametric test will 
be regarded as the more critical test at 
the fast and slow levels. 

Table 2 presents the reading means and 
levels of significance by both the t and 
Wilcoxon tests on an alternate form of 
the Durrell-Sullivan. For both the fifth 
and sixth grades the results are reasonably 
clear-cut. The reading means of the fast 
and slow experimental groups are signifi- 
cantly higher at well less than the .10 
level, with the exception of the fast sixth- 
grade level where none was expected (see 
Limitation 1). When the fast and slow 
levels are combined to form total experi- 
mental and control groups, the difference 
is significant at less than the .01 level at 
the fifth grade and at less than the .05 
level at the sixth grade. 

Table 3 presents the same analysis as 


Table 2 on an alternate form of the Cali- 
fornia. Here it may be seen that results 
are in line with the results obtained on 
the Durrell-Sullivan. It is interesting to 
note that the means of the California are 
significantly higher than those of the Dur- 
rell-Sullivan at both the fast and slow 
levels. Since it was not the reading meas- 
ures that were initially matched, but 
rather the Ss themselves on the two inde- 
pendent measures, this finding has only 
academic interest. 

Turning next to Limitation 3, we find 
that the 23 fifth-grade Ss leaving their 
homerooms for reading instruction made 
significantly higher mean reading scores 
on both reading measures than did their 
matched counterparts in the control 
classes. The means for the Durrell-Sulli- 
van were 5.68 to 5.40, and 6.33 to 5.90 
for the California. The null hypothesis of 
equal effectiveness of teaching method was 
rejected at less than the .05 level by the 
Wilcoxon test for both reading measures. 
The null hypothesis was also rejected at 
less than the .05 level in favor of the re- 
maining 24 experimental Ss who received 


TABLE 2 
DuRRELL-SULLIVAN READING MEANS AND 
Tests or SIGNIFICANCE BY THE ¢ AND 
Witcoxon TEsts 
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TABLE 3 
CALIFORNIA READING MEANS AND TESTS OF SIGNIFICANCE BY THE ¢t AND WILCOXON TESTS 
California # Ratios Wilcoxon Test 
Group 
Number Mean Si: —i: ‘ P T P 
5th S-E 20 5.83 .181 2.65 <.02 39 <.02 
20 5.35 
F-E 27 7.18 .142 2.04 <.06 80 <.05 
F-C 27 6.89 
T-E 47 6.60 .130 2.85 <.01 
T-C 47 6.23 
6th S-E 16 6.58 . 203 1.82 <.10 31.5 <.07 
8-C 16 6.21 
F-E 27 8.50 .120 1.66 >.10 121 >.10 
F-C 27 8.30 
T-E 43 7.78 .110 2.36 <.05 
T-C 43 7.52 


reading instruction in their homerooms. 
At the sixth-grade level only 14 Ss left 
their homerooms for reading instruction 
(see Limitation 1). Of these 14 experi- 
mental Ss, 11 made higher reading scores 
on both tests, which called for a rejection 
of the null hypothesis at less than the .05 
level by the Wilcoxon test. Thus, with 
respect to Limitation 3, it can be con- 
cluded that the experimental plan of 
reading was approximately as effective 
for those Ss leaving their homerooms for 
reading instruction as it was for those Ss 
remaining in their homerooms. 


Discussion 


Attempts to explain the obtained su- 
periority of the experimental plan are of 
course at this time highly conjectural. 
Nevertheless, we would like to suggest the 
following reasoning. The placing of stu- 
dents into reading groups where the read- 
ing range and variance have been greatly 
reduced means that the individual student 
can safely and securely face up to his 
reading problem without losing face. For 
the slow student, he is now functioning in 
a nonthreatening group of other problem 
readers who are experiencing problems 
similar to his. It is our hypothesis that 
this nonthreatening situation allows the 


student to receive maximum positive re- 
ward or feedback from reading material 
he can now read. This rarely happens in 
heterogeneous competitive groups at 


Grades 4, 5, and 6 where grouping within © 
the class is practically nonexistent. Such © 


grouping also allows the teacher to de- 
velop better techniques to handle the re- 
stricted reading range and thereby maxi- 
mizes her opportunities to offer verbal and 
emotional rewards. Once again, however, 
it is our hypothesis that the nonthreaten- 
ing group atmosphere which allows for a 
maximum positive feedback from readable 
materials is the more important factor 
of the two. The advantages for the high 
achievers are so obvious that they need 
not be discussed. 

The results obtained do appear to offer 
rather clear support for the notion that 
the Joplin plan is a more effective plan 
of reading than the traditional plan of 
self-contained classrooms with reading 
ranges of from six to eight grades at 
Grades 4, 5, and 6. In actuality, this sug- 
gests that teaching or more probably 
learning effectiveness increases as the 
group becomes more truly homogeneous. 

Concerning this experiment, care must 


be taken not to generalize the significant 
findings to other schools. Further experi- | 


is 
SI 
sil 
de 
ti 
pl 
in; 
mi 
kr 
de 
18 
th 
gre 
a pe 
des 
fas 
rar 
cor 
lev 
at 
a ¢ 
Te: 
7 
| 
| 


THE JOPLIN VS. A TRADITIONAL METHOD OF READING 73 


mentation may well show that the plan 
is not equally effective for all schools. 
Should other investigators obtain results 
similar to these, especially in multischool 
designs, an intensive analysis or observa- 
tion of these groups at work should prove 
productive in isolating some of the teach- 
ing and learning variables that make for 
more efficient learning, something we still 
know little about, despite a host of aca- 
demic learning experiments. 


SuMMARY 


At the conclusion of Grades 4 and 5, 
180 of 226 Ss were matched separately on 
the Durrell-Sullivan and California read- 
ing achievement tests. The Ss of each 
grade scoring above and below the ex- 
pected grade norms of 4.8 and 5.8 were 
designated fast and slow achievers. The 
fast and slow Ss of each grade were then 
randomly assigned to experimental and 
control groups, which resulted in matched 
control and experimental groups on two 
levels. Experimental and control groups 
at all levels were also found to be from 
a common population on the California 
Test of Mental Maturity. 

Teacher assignments to these groups 


were carried out by random assignment. 
Supplementary reading materials and 
teacher supervision were equalized. The 
experimental Ss received reading instruc- 
tion according to the Joplin plan. The 
control Ss received reading instruction in 
traditional self-contained classrooms. The 
experiment ran for one year, at which 
time the experimental groups at all levels 
made significant gains. It was concluded 
that, for the single school where the ex- 
periment was run, the Joplin plan was 
more effective. It was suggested that the 
obtained superiority might be due to the 
lowered variance of the experimental 
groups, which allowed the teacher to offer 
more effective verbal and emotional re- 
wards, but more particularly allowed the 
slow student to function in a nonthreaten- 
ing atmosphere which maximized positive 
feedback from readable materials. 
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The work of Ojemann, Levitt, Lyle, 
and Whitside (1955), Stiles (1950), and 
others has demonstrated that teaching ele- 
mentary school children a “dynamic” or 
“causal” approach to behavior can be suc- 
cessfully accomplished with appropriately 
trained teachers and materials. Psychology 
taught in the elementary grades thus far 
has had, primarily, a mental health em- 
phasis. In this paper we will report the 
results of a rather informal preliminary 
program for the inclusion of a section on 
psychology as an integral part of the ele- 
mentary school natural science course. 


PROCEDURE 


The third author (Henderson), who did 
the teaching described, had one semester’s 
previous experience teaching elementary 
school, and had completed a minor in psy- 
chology as an undergraduate. One of the 
courses making up this minor was Analy- 
sis of Behavior which used Principles 
of Psychology by Keller and Schoenfeld 
(1950), and included a three-hour weekly 
laboratory in which each pair of students 
demonstrated principles of learning using 
a white rat, a Skinner box, and a labora- 
tory manual by Homme and Klaus (1957). 
Henderson, after listening to some of the 
questions his elementary students posed 
concerning animal behavior, decided that 
a section might be added to his natural 
science program incorporating many of the 


* Presented at American Psychological As- 
oy sociation meeting, Cincinnati, September, 
1959. 


A NOTE ON THE TEACHING OF SKINNER’S 
DESCRIPTIVE BEHAVIORISM IN 
THE FIFTH GRADE’ 


H. E. KLUGH, W. A. DETERLINE 
Alma College 


and D. E. HENDERSON 
East Jordan Public Schools 


experimental studies described in his col- 
lege manual. 

The experimental class consisted of 33 
unselected fifth-grade children. The mean 
California IQ was 100. A 12-item pretest 


covering the proposed material was ad- — 


ministered to the experimental and to a 
control class of 34 children whose mean 
California IQ was 104. The means of the 
12-item pretest indicated no significant 
pretreatment differences. 

The method of procedure was to begin 


the class period with a brief discussion of | 


the phenomena to be demonstrated. The 
children then grouped around the teacher’s 
desk on which was a Skinner box equipped 
with a manually operated water reinfore- 
ing device. Students acted as timers and 
recorders, and manipulated light Ss and 
reinforcement. In meetings of about 30- 
min. duration twice a week over an eight- 
week period the instructor discussed and 
demonstrated reinforcement, extinction, 
spontaneous recovery, discrimination, gen- 
eralization, the concept of SP and S4 and 
the development of operant chains. At 
the conclusion of each demonstration the 
data were recorded on a blackboard, and 
all children produced graphs showing the 
appropriate functional relationships. This 
was the first use of graphs for these chil- 
dren. Emphasis throughout the demon- 
strations was on careful observation of the 
animal’s behavior and its possible relation- 
ship to both controlled and uncontrolled 
external conditions. The children noted, 
somewhat in advance of the instructor, the 
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TEACHING BEHAVIORISM IN THE FIFTH GRADE 


effect of a stop watch click acting as a 
possible secondary reinforcer. At the end 
of each demonstration the discussion pe- 
riod attempted to show how the principle 
which operated with the rat might be used 
to predict and explain aspects of the be- 
havior of pets, the children themselves, or 
parents and teachers. These discussion ses- 
sions were recorded on tape and sent to the 
college where suggestions and comments 
were added and the tapes returned. Mid- 
way through the program a chapter of 
reading material (Klugh, 1959) was circu- 
lated to the children, and this was ap- 
parently of some assistance. 


RESULTS AND COMMENT 


The results of a readministration of the 
12-item achievement test indicated a sig- 
nificant increase in knowledge of and 
verbal application of concepts for the 
experimental class (t = 3.2, df = 60, 
p = 1). The control group posttest 
showed a slight decline in mean test score 
from the pretest condition. It is difficult 
to evaluate such a program. The authors 
were impressed by the great enthusiasm 
the children showed for the course, the 
care with which they made their observa- 
tions, and the ease with which they could 
ask incredibly difficult questions. The au- 
thors feel that with appropriately trained 


teachers, apparatus, an’ reading material 


there is reason to believe that rudimentary 
understanding of the simpler functional re- 
lationships in behavior can be taught in 
the science program of the later elemen- 
tary grades and that such a program will 
be enthusiastically received by the chil- 
dren. 
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Summary 


An unselected fifth-grade class of 33 
children served as subjects in an explora- 
tory study of the limits and possibilities of 
teaching simple concepts of Skinner’s be- 
haviorism in the elementary grades. Skin- 
ner box demonstrations were given twice 
a week as part of the regular science pro- 
gram. Concepts demonstrated were rein- 
forcement, extinction, secondary rein- 
forcement, discrimination, generalization, 
partial reinforcement, and chaining. Chil- 
dren operated the apparatus, drew graphs, 
and took part in discussions relating the 
concepts demonstrated to human behavior. 
The children were enthusiastic, interested, 
and apparently acquired an elementary 
understanding of these concepts in terms 
of human behavior, as measured by a 
brief achievement test. 
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THE VALIDITY OF THE MINNESOTA TEACHER 
ATTITUDE INVENTORY 
OWEN SCOTT 
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This study investigated the validity, pre- 
dictive and concurrent, of the Minnesota 
Teacher Attitude Inventory (MTAI) for 
student teaching. Data obtained in the 
validity study were also used to investigate 
changes in the attitudes of student teachers 
during student teaching. 


SUBJECTS AND PROCEDURE 


Eighty-two student teachers enrolled in 
seven white teacher education institutions 
in Georgia participated. These student 
teachers were enrolled for student teach- 
ing on a full-time basis for approximately 
13 weeks. Data were also obtained from 
the supervising teachers of these student 
teachers and from 1,973 public school 
pupils. Participation was voluntary with 
teachers and pupils in Grades 4 through 
12 included. Two-thirds of both the teach- 
ers and the pupils were in Grade 9 and 
above. Subjects taught by the high school 
teachers included English, mathematics, 
science, social science, and home eco- 
nomics. 

The study of attitude changes employed 
three sets of data: MTAI scores of student 
teachers obtained within 10 days prior to 
student teaching, MTAI scores of their 
supervising teachers obtained early in the 
student teaching period, and MTAI scores 
of student teachers within two or three 
days after completion of student teach- 
ing. For the predictive and concurrent 
validity studies, these MTAI scores were 
used, and, in addition, scores obtained 
on a paper-and-pencil measure of student- 
teacher-pupil personal relations were uti- 
lized as criterion measures. 

Descriptions of teacher behavior and of 
the relationship of teacher behavior to 
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MTAI score contained in the MTAI man- | 
ual indicate that teachers scoring high on | 
MTAI tend to establish classroom per- 
sonal relations customarily described a 
“democratic,” while low scoring teachers | 
tend to establish relations usually char- | 
acterized as “undemocratic” (Cook, Leeds, | 
& Callis, 1951, pp. 3-4). The selection or | 
construction of a criterion instrument for 
measuring “democratic personal relations 
in the classroom” posed quite a problem. | 
After a careful study of relevant literature | 
and of many instruments, the investigators 
decided to describe “democratic personal 
relations in the classroom” and to devise | 
an instrument based on this description | 
(Bryan & Yntema, 1939; Grim & Hoyt, 
1952; Ryans, 1952; Ryans & Wandt,| 
1952). For purposes of this study the | 
phrase means that the student teacher is | 
successful in (a) creating a friendly class 
room atmosphere, (b) establishing in pu- 
pils (and possessing himself) a feeling of 
psychological security, (c) exerting a sta- 
bilizing influence on pupil behavior, (d) 
developing pupil self-responsibility, (e) 
helping pupils understand and accept se 
cial controls, (f) understanding and ae- 
cepting each pupil as a person, (g) help 
ing each pupil establish a feeling of his 
own personal worth, and (h) treating 
pupils in consistent fashion from day to 
day. 

Using this reference frame, the investi- 
gators constructed a 15-item device to 
be answered by pupils, designated as the 
Classroom Personal Relations booklet 
(CPR). The first 14 items are concerned 
specifically with personal relations in the 
classroom, while the 15th is a “global” item | 
inquiring about the pupil’s estimate of the | 
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student teacher’s over-all teaching effec- 
tiveness in comparison with other teachers 
the pupil has had. For each item the five 
possible responses are designed to lie at 
approximately equal intervals along a con- 
tinuum from “effective” to “ineffective” 
in terms of the aspect of personal relations 
dealt with. The Likert method of scoring 
using arbitrary weights was adopted, us- 
ing weights of 1, 2, 3, 4, and 5 (Likert, 
1932). Seven of the first 14 items were 
randomly selected to be scored in reverse 
with Option 1 scored as 5, etc. The arith- 
metic mean CPR score of a class taught 
by the student teacher was selected as the 
criterion measure. 

In addition to using the reference frame 
referred to above in devising items, the 
investigators had each item carefully 
checked by selected elementary and high 
school teachers, by a number of specialists 
in elementary and secondary education, 
and by some pupils. Precautions taken to 
make probable the validity of CPR were 
at least as adequate as those taken with 
other instruments used in making validity 
studies of MTAI (Cook & Leeds, 1947; 
Leeds, 1950; Sandgren & Schmidt, 1956). 

The equivalent halves method of esti- 
mating the coefficient of internal con- 
sistency was employed, using the first 14 
CPR items. The split-half r was .72; the 
reliability coefficient, using the Spearman- 
Brown formula, was .84 (McNemar, 1955, 
pp. 156-157). Standard deviations of the 
distributions of CPR scores were found to 
be between 3 and 4 (range of possible 
scores, 14-70), so that the standard errors 
of measurement were in the neighborhood 
of 1.6, and the standard error of measure- 
ment of the mean, approximately 0.3. 

At each of the seven teacher education 
institutions a project coordinator, usually 
the college supervisor working with the 
supervising and student teachers, was 
designated to obtain voluntary partici- 
pants, to administer MTAI both to stu- 
dent teachers and to supervising teachers, 
to score the CPR responses, and to tab- 
ulate the data on specially prepared data 
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sheets. Supervising teachers administered 
CPR after the student teachers had com- 
pleted student teaching and left their 
schools. In the instances of student teach- 
ers who taught more than one class of 
pupils, either the class in which the stu- 
dent teacher had done the most teaching 
was selected, or, if the student teacher had 
taught about the same length of time in 
several classes, one class was randomly se- 
lected to respond to CPR. 


CHANGES IN THE ATTITUDES OF 
STUDENT TEACHERS DuRING 
TEACHING 


Of many factors possibly associated 
with observed changes in attitude (eg., 
influence of prior professional courses, 
factors traceable to the home environ- 
ments of the student teachers, etc.) only 
one factor, the attitude of the supervising 
teachers, was considered in this study. 
Data were available for 77 student teach- 
ers. The mean attitude change of this 
group was determined and tested, the 
mean attitude changes of two sets derived 
from this group were similarly examined, 
and investigation was also made of indi- 
vidual changes in attitude of the student 
teachers. Table 1 contains data pertinent 
to these analyses. 

All three of the student pretest-super- 
visor mean differences in Table 1 are sig- 
nificant. However, only one of the stu- 
dents’ pre- and posttest mean differences 
is significant—that for the 47 student 
teachers whose initial MTAI scores were 
in each instance lower than those of their 
supervising teachers. Of the six r’s in the 
last two rows, three are significant with 
only one being significant at the .01 con- 
fidence level. These statistics seem to show 
some slight degree of association between 
attitude changes of student teachers and 
the attitudes of their supervising teachers, 
with the association limited to instances 
in which the student teachers made lower 
initial scores than did their supervising 
teachers. 

Analysis of changes in attitude of in- 
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TABLE 1 


CHANGES DURING StupENT TEACHING IN MEAN MTAI Scores or 77 StupentT TEACHER: | 
AND OF Two Sets DERIVED FROM THE GROUP 


Students with Initital Scores 


Total Group 
(N = 47) (N = 30) (N = 77) 
Initial MTAI scores of students 
Mean 22.8 56.0 35.7 
Standard error of mean 2.8 3.4 2.8 
Poststudent teaching MTAI scores 
Mean 34.7 53.9 42.2 
Standard error of mean 3.5 4.4 2.9 
MTAI scores of supervising teachers 
Mean 62.7 34.3 51.6 
Standard error of mean 3.1 3.1 ‘7 
Differences: Means of students’ pre- and 11.9** -2.1 6.5 
Posttests (MTAI) 
Differences: Means of students’ Pretests —39.9** .7"" —15.9** 
and supervisors’ MTAI scores 
Correlation: Students’ pre- and Posttests -60** .60** .60** 
(MTAI) 
Correlation: Students’ pretests with super- 27 
visors’ MTAI 
Correlation: Students’ posttest with super- 34* 34 .o4** 


visors’ MTAI 


Note.—Correlations are Pearson r’s; linearity of regression was tested in each instance. 


* Significant at .05 level of confidence. 
** Significant at .01 level of confidence. 


dividual student teachers corroborates the 
association just mentioned. Nineteen of 
the 77 student teachers made statistically 
significant gains in score (.05 level of con- 
fidence), 12 underwent significant losses, 
and 46 underwent no significant change. 
Fourteen of the 19 student teachers mak- 
ing significant gains were in the set of 47, 
while 8 of the 12 undergoing significant 
losses were in the set of 30. The chi square 
(8.08 with 2 pF) computed for these data 
is significant beyond the .05 level of con- 
fidence, the major contribution to it being 
the number of significant positive changes 
in attitude in the set of 47. The contin- 
gency coefficient (N of 77) turns out to 
be 31. For the 31 students who under- 
went significant changes in attitude dur- 
ing student teaching, the r between their 
attitude changes and the differences be- 
tween their initial scores and the scores 
of their supervising teachers is .42, sig- 
nificant at the .01 level of confidence. 


PREDICTIVE AND CONCURRENT 
VALIDITIES OF THE MTAI 


The predictive validity study focused 
on the question, “To what extent will the 
attitudes of student teachers toward pu- | 
pils prior to student teaching (measured | 
by MTAI) predict the personal relation: | 
with pupils they actually establish during 
student teaching?” The concurrent vs 
lidity study was concerned with the ques 
tion, “To what extent are attitudes towari 
pupils expressed by student teacher | 
during student teaching (measured by | 
MTAI) accompanied by the establishment | 
of corresponding personal relations with 
pupils?” Answering these two question 
necessitated the construction of the ct- 
terion device previously described. 

One of the statistics used to determin 
both the predictive validity and the com | 
current validity of MTAI was the partial | 
correlation coefficient indicating the tr 
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lationship between student teachers’ 
MTAI scores and the criterion measure, 
with supervising teacher MTAI scores held 
constant. For the predictive validity study 
the student teachers’ MTAI scores just 
prior to student teaching were used; for 
the study of concurrent validity, the stu- 
dent teachers’ MTAI scores immediately 
after student teaching were employed. For 
both studies, mean class scores on the 
first 14 CPR items were used as cri- 
terion measures. Table 2 presents a sum- 
mary of data relevant to these two va- 
lidity investigations. Data contained in 
Table 2 clearly indicate that neither the 
predictive validity coefficient nor the con- 
current validity coefficient was signifi- 
cantly different from zero at the .05 
confidence level. 

Although MTAI had neither predictive 
nor concurrent validity in terms of mean 
responses to the first CPR items taken as 
a whole, possibly it had validity in terms 
of responses to individual CPR items. In 
both the predictive and validity study, 
item analyses were made for each CPR 
item. The 10 student teachers scoring high- 
est on MTAI and the 10 scoring lowest 
were selected in each study, and CPR item 
responses were analyzed to determine 
which items differentiated significantly be- 
tween the two groups of student teachers. 
In the predictive validity item analyses, 
the responses to each item of the 252 
pupils in the classes of the 10 highest scor- 
ing (on MTAI) student teachers were 
treated initially as one distribution, and 
the responses of the 263 pupils in the 
classes of the 10 lowest scoring student 
teachers were similarly considered. Item 
analyses in the concurrent validity study, 
using the responses to each item of the 242 
pupils in the classes of the 10 highest scor- 
ing (on MTAI) student teachers and of 
the 237 pupils in the classes of the 10 
lowest scoring student teachers, were made 
in like fashion. In the analyses, responses 
scored as 5, 4, or 3 were considered “favor- 
able”; those scored as 2 or 1, as “un- 
favorable.” The percentage of favorable 
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TABLE 2 
PREDICTIVE AND CONCURRENT VALIDITY 
CoEFFICIENTS (AND OTHER RELEVANT 
Facts) OBTAINED FoR THE MTAI 


| 

Pre- Con- 
dictive | current 
= 74)\(N = 69) 


Student teachers’ MTAI 


scores 
Means 35.2 | 42.2 
Standard Deviations 34.2 | 34.4 
Supervising teachers’ MTAI 
scores 
Means 51.6 | 51.6 
Standard Deviations 32.7 | 32.7 
Pupils’ CPR scores 
Means 58.7 | 58.8 
Standard Deviations 3.0 3.0 


Correlations : Student teach- 
ers’ and supervisors’ 
MTAI scores .27 

Correlations: Student teach- 
ers’ MTAI and pu- 
pils’ mean CPR scores 

Correlations: Supervisors’ 
MTAI and pupils’ mean 
CPR scores -01 | —.01 

Partial Correlation: Stu- 
dent teachers’ MTAI 
and pupils’ CPR scores 
with supervisors’ MTAI 
held constant -03 | —.04 


Note.—For each r linearity of regression was tested. 


responses in each distribution was com- 
puted, and the significance (.01 level of 
confidence) of the difference between per- 
centages was tested (McNemar, 1955, pp. 
60-61). Table 3 lists for both the pre- 
dictive and concurrent validity studies 
the items and percentages for which the 
difference in percentages of favorable re- 
sponses was significant. 

In the predictive validity item analy- 
ses some items were found to discriminate 
positively; others, negatively. For each of 
these six items, 70% or more of the pupils 
in both groups responded favorably so that 
discriminations were in terms of differences 
in sizes of the majorities of favorable re- 
sponses. The items which discriminated 
positively ask specifically about the inter- 
personal relations between pupil and stu- 
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TABLE 3 
CRITERION ITEMS FOR WHICH DIFFERENCES 
IN PERCENTAGE OF FAVORABLE ReE- 
SPONSES WERE SIGNIFICANT 


Predic- | Concur- 
tive* rent** 


1. How do you feel toward | 79) 70) 76) 60 

your student teacher? 

2. How does your student | 83) 72) 85) 77 
teacher feel toward you? 

4. Are your classmates | 71) 82) —| — 
good friends with you? 

8. How do you feel about | 74; 85) 79) 67 
disagreeing with your 
student teacher? 

13. Does your student | 80) 92) 83) 74 
teacher treat you in 
about the same way 
every day? 

14. Does your student | 93) 76) 93) 71 
teacher understand you 
and your problems? 

15. Does your’ student | —} 82) 70 
teacher teach as well as 
other teacher you have 
had? 


Note.—For the predictive validity study, MTAI 
scores were obtained prior to student teaching. 

For the concurrent validity study, these were ob- 
tained immediately after student teaching. 

Nu : The number of pupils in the classes (one per 
student teacher) of the 10 student teachers scoring high- 
est on MTAI. 

Nz: The number of pupils in the classes (one per 
student teacher) of the 10 student teachers scoring low- 
est on MTAI. 

Py : The percentage of pupils responding favorably 
to the indicated CPR item in the classes of tae 10 student 
teachers scoring highest on MTAI. 

PL : The percentage of pupils responding favorably 
to the indicated CPR item in the classes of the 10 student 
teachers scoring lowest on MTAI. 

*In the predictive validity study Nu was 252 and 
NL, 263. 

** In the concurrent validity study Ny was 242 and 
NL, 237. 


dent teacher; the ones which discriminated 
negatively ask questions about the friend- 
ship of fellow pupils, about feelings con- 
cerning disagreeing with the student 
teacher, and about the consistency with 
which the student teacher treats the pupil. 

Six items were also found to discrimi- 
nate significantly, positively in each in- 
stance, in the concurrent validity study. 
As in the predictive validity study, all dis- 
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criminations were in terms of differences 
in sizes of the majorities of favorable re- 
sponses. Inspection of the content of these 
six discriminating items makes clear the 


fact that they pertain to three of the eight 
characteristics of “democratic personal re- 


lations in the classroom” (defined in this 


study): (a) creating a friendly atmos- 
phere, (b) treating the pupil in consistent 
fashion from day to day, and (c) under- 
standing and accepting the pupil as a per- 


the pupil’s estimate of the student teach- 
er’s over-all teaching effectiveness. 

Each of the two sets of responses to an 
item considered as one distribution in 
testing the significance of the difference in 
percentages of favorable responses was, in 


son. Also involved (CPR Item 15) was | 


fact, a composite of 10 distributions. In : 


both the predictive and concurrent va- 
lidity studies, each of these composites (24 
in all) was tested for homogeneity of vari- 
ance and for significance of the differences 
among means. Homogeneity of variance 
among the distributions comprising each 
composite was tested, using Bartlett’s chi | 
square and the significance of differences 
among means within each composite was 
tested, using an F test (Edwards, 1950, 
Chaps. 9 and 10). In each instance in 
the predictive validity study in which the 
variances were found to be heterogeneous, 
two independent transformations of the 
scores were effected (first, a square root 
transformation; then, a logarithmic trans- 
formation) because of the possible effects 
of heterogeneity of variance upon the dif- 
ferences among means (Edwards, 1950, 
pp. 197-203). In no instance did score 
transformation change the results of the 
analysis. Since the concurrent validity data 
were similar to those of the predictive 
validity study, no transformations were 
made in the concurrent validity study. In 
only one pair of composite distributions 
(from CPR Item 4 in the predictive va- 
lidity study) were differences among 
means found to be nonsignificant. j 

One may conclude from these data that, | 
except for CPR Item 4 in the predictive ~ 
validity study, both among the classes of | 
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student teachers scoring highest on MTAI 
and those of student teachers scoring 
lowest, there were significant differences 
in responses to those CPR items which 
iscriminated between the group of highest 
scoring (MTAI) student teachers and the 
group of lowest scoring ones. In other 
words, intragroup differences were sig- 
nificant as well as intergroup differences. 


SuMMARY 


Eighty-two student teachers enrolled 
in seven white teacher education institu- 
tions in Georgia responded to the Minne- 
sota Teacher Attitude Inventory (MTAI) 
both just prior to and just after a quarter 
of full-time off campus student teaching. 
Analysis of differences in prestudent teach- 
ing and poststudent teaching responses 
to MTAI were made, both for changes in 
group means and for changes in indi- 
vidual scores. The group of student teach- 
ers working with supervising teachers 
whose attitudes toward pupils were, in 
each instance, superior to their own, im- 
proved significantly as a group in their 
attitudes toward pupils during student 
teaching (i.e., the change in the group’s 
mean score during student teaching was 
significant). The group working with 
supervising teachers whose attitudes were, 
in each instance, inferior to theirs, did not 
as a group change significantly in their 
attitudes toward pupils during student 
teaching. Further analyses, based on the 
attitude changes of individual student 
teachers, corroborated these findings. 

A purpose of the study was to examine 
the predictive and concurrent validities 
of MTAI, using mean pupil responses to 
a carefully constructed check list, the 
Classroom Personal Relations booklet 
(CPR) as criteria. With these criteria, 
MTAI had neither predictive nor con- 
current validity, a finding in conflict with 
some other studies (e.g., Hoyt & Cook, 
1959; Sandgren & Schmidt, 1956; Stein 
& Hardy, 1957; Stogdill & Coons, 1957). 
When responses to individual CPR items 
were used as criteria, in some instances 
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there were statistically significant dif- 
ferences between student teachers scoring 
highest on MTAI and those scoring low- 
est; however, these differences were of 
little practical significance. 
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A STUDY OF MEDIATION IN MENTALLY RETARDED 
AND NORMAL SCHOOL CHILDREN’ 


GERSHON BERKSON anp GORDON N. CANTOR 
George Peabody College for Teachers 


Numerous experiments have provided 
evidence bearing on the mediation hypoth- 
esis, a notion which is playing an increas- 
ingly prominent role in S-R theory. One 
experimental paradigm found in such stud- 
ies utilizes three lists of stimulu which, for 
present purposes, may be labeled A, B, and 
C. If the subject (S) learns paired-associ- 
ates involving Lists A and B, and then ones 
involving Lists B and C, mediation theory 
predicts that the subsequent learning of 
paired-associates involving Lists A and C 
will be facilitated due to the prior learning 
experience. Data relevant to this formula- 
tion have been reported on college students 
(Peters, 1935; Bugelski & Scharlock, 1952; 
Russell & Storms, 1955) and on normal 
preschoolers (Norcross & Spiker, 1958). 

The present investigation involved an at- 
tempt to demonstrate the occurrence of 
mediation in normal and mentally retarded 
elementary school children, using the A-B, 
B-C, A-C paradigm. An additional purpose 
of the study was to compare normal and 
retarded children in their ability to learn 
paired-associates and to perform a media- 
tion task. 

The experimental predictions were as 
follows: (a) the normal Ss would learn lists 
of paired-associates more quickly and with 
fewer errors than would the retarded Ss; 
(b) both the retarded and normal Ss would 
be capable of mediation behavior such as 
that described above; and (c) a difference 
in IQ level would not affect the degree of 

? This investigation was partially supported 
by the National Institute of Mental Health 
through the mental deficiency training pro- 
gram at George Peabody College for Teach- 
ers. Acknowledgment is made to the Nash- 
ville, Tennessee City School System for 
permission to carry out the experiment in 
the Nashville public schools. 


facilitation achieved through the mediation 
process. 

The first prediction was based on the ob- 
servation that the learning of retarded in- 
dividuals is generally less efficient than that 
of normals. With respect to the second pre- 
diction, it was assumed that the retarded as 
well as the normal Ss could reach a learning 
criterion on the A-B and B-C pairs. En- 
ploying this assumption, S-R theory would 
lead one to expect facilitation in the learn- 
ing of the A-C list in both groups of Ss. 
Again, assuming the attainment of a com- 
mon learning criterion on the A-B and B-C 
lists for both groups, there would be no 
basis in S-R theory for expecting a dif- 
ferential amount of facilitation in the 
groups (i.e., with equal A-B and B-C habit 


strengths built up in both groups, one | 


would anticipate an equal degree of facili- 


tation on the A-C list); this served as the 


rationale for the third prediction? 
Assuming all three predictions were con- 
firmed by the results, one would have a 
basis for contending that the teacher must 
train retarded pupils longer to reach a 
given level of verbal learning, but once that 


?An assumption is made here that the 
groups acquire comparable habit strengths 
on the A-B and B-C lists by virtue of reach- 
ing the same learning criterion. Strictly 
speaking, if the retarded Ss require more 
trials to reach criterion and if habit strength 
is defined solely in terms of number of re- 
inforcements, it would follow that the re- 
tarded Ss approach the A-C list with greater 
habit strength for at least some of the A-B 
and B-C components. In this case, compara- 
ble facilitation on the A-C list could be at- 
tributed to such a habit differential. Because 
of the results actually obtained on the A-B 
and B-C lists, however, this complication is 
largely eliminated in interpreting the results 
of the present experiment. 
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MEDIATION IN THE MENTALLY RETARDED 


level is achieved, mediation based on such 
Jearning will occur in the same degree in 
the retarded as in normals. 


MetTHOD 


Subjects and materials. The Ss were 54 
children attending regular and special 
classes in the Nashville, Tennessee City 
School System. All Ss were Caucasian and, 
with one exception, were chosen from 
schools located in lower socioeconomic 
neighborhoods. The Ss were between 9 and 
12 yrs. old. The Ss in the normal group had 
1Qs between 86 and 115. The mentally re- 
tarded Ss had IQs between 55 and 85. The 
instruments used for measuring intelligence 
were the Kuhlmann-Finch Test (Forms 
III and V). There were 30 Ss in the normal 
group and 24 Ss in the retarded group. 
Within each IQ level, Ss were assigned at 
random to experimental and control con- 
ditions. A summary of the characteristics 
of the groups may be seen in Table 1. 

No child was included in the experiment 
if his MA was below seven years, if he had 
been diagnosed as brain injured by a phy- 
sician, or if he could not meet the learning 
criterion used in the study. Four normal Ss 
and 12 retarded Ss were excluded from the 
study on the basis of one or more of these 
criteria. 

The materials consisted of three lists of 
paired stimuli. Each stimulus was pasted 
or painted on a 5 x 8 in. unlined filing card. 
The stimuli fell into four classes—the odd 
arabic numerals from 1 through 11, the 
even arabic numerals from 2 through 12, 
pictures of common objects, and hexagons 
varying in color. The common objects pic- 
tured were an automobile, a camera, an 
airplane, a baby, a girl, and a man. The 
hexagons were colored red, green, yellow, 
blue, purple, or orange. 

There were six pairs of stimuli in each of 
the three lists. A pair was made up of two 
cards set back to back and encased in a 
transparent, colorless celluloid envelope. 

Design. The design of the experiment is 
presented in Table 2. The independent var- 


TABLE 1 
1Q, MA, anv CA or THE Groups 


1Q MA (mo.)| CA (mo.) 


M |SD| M |SD| M | SD 


Experimental 15 |100.33/7.99/130.80)7.71/130.93) 6.03 


Control normal | 15 | 97.27 oo @ 134.27) 6.09 
Experimental 12 | 70.25/6.98) 99.17/7.48)142.00/10.95 
retarded | 


Control 12 | 69.33/6.64| 95.42/6.83/138.17/11.52 
retarded | 
TABLE 2 
DESIGN OF THE EXPERIMENT 
IQ Level 
Normal Retarded 


Group 


List | List | List | List | List | List 
I I Ir | 


Experimental | A-B | B-C | A-C | A-B B-C/| AC 
Control X-B/} B-C | A-C | X-B/ BC | A-C 


iable was involved in the first list learned. 
In List I, the stimulus cards for the experi- 
mental Ss were the odd numerals (A). The 
controls received the even arabic numerals 
(X) as stimuli. The response cards for both 
experimental and control groups were the 
pictures of common objects (B). 

Lists II and III were the same for the 
experimental and control groups. The stim- 
uli of List II were the objects (B), and the 
response cards were the colors (C). In List 
III, the stimuli were the odd numbers (A), 
and the responses were colors (C). 

Within Lists I and II, the pairing of 
stimulus and response cards was random. 
In List III, the numbers and colors were 
paired which, for the experimental group, 
had previously been paired with the same 
object. For example, if the number “three” 
had been paired with “man” in List I and 
“man” had been the stimulus for “yellow” 
in List II, then “three” and “yellow” were 
paired in List ITT. 

The Ss learned each list to a criterion of 
five successive correct repetitions. No S 
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was included in the analysis of the results 
who did not meet this criterion within 30 
presentations of the full list on any of the 
three lists. The Ss who were not included 
in the analysis of the results were replaced 
by random selection of other available Ss 
from the same population. 

Procedure. The Ss were led into the ex- 
perimental room, which was either an 
empty classroom or testing room in the 
school which S attended. The S was asked 
to sit in a chair on the opposite side of the 
desk from E. The E then read the following 
instructions to S: 


This is a game in which I want to see how 
well boys and girls can learn which things go 
together. On each of these cards there is a 
number or a picture or a color. 


The S was shown examples of the stimuli. 
At this point, all the odd numbers (A) were 
presented to the control Ss, and these Ss 
were asked to read them.* The E then pro- 
ceeded with the instructions as follows: 


In the first part of the game, the picture 
will be on the back of the number, like this. 
When I show you a number, you will have to 
guess which picture is on the back. Do you 
understand? Now, here are some numbers 
and the pictures which go with them. Tell me 
the name of the number and the picture as I 
show them to you. 


The E went through List I once with S. If 
S was unable to name one or more of the 
stimuli presented, he was not permitted to 
continue but was given the reward. Other- 
wise E continued with the instructions: 


Good. Now, if you learn what I want you 
to learn, I will give you a Hershey bar. O.K.? 
Can you tell me the name of the picture 
which goes with this number? 


The paired-associates were presented 
in a prearranged, random order which 
changed with each successive presentation 
of the list. When S made his first error, Z 
told him that he was working on a hard 


*The control Ss were asked to read the 
odd numbers in order to establish that they 
had prior familiarity with these stimuli. 
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task, that everyone makes mistakes, and 
that he should not worry about making er- 
rors. When S had completed List I, F asked 
him to walk around the room while E re- 
arranged the cards. Introducing List II, £ 
said: 

This time, the pictures will be on the front 
and the colors will be on the back. Tell me 
the names of the pictures and the colors as] 


show them to you. Can you tell me which 
color goes with this picture? 


Following List II, S was again asked to 
walk around the room. Before List III, £ 
said: 

This time the numbers will be on the front 
and the colors on the back. Tell me the names 
of the numbers and the colors as I show them 


to you. Can you tell me which color goes with 
this number? 


When S completed the final list, he was 
given a chocolate bar and returned to his 
classroom. 


RESULTS 


Analysis of variance was applied to the 
criterion scores by means of a 2 x 2 fae- 
torial design. The three measures analyzed 
were CA, number of trials taken to reach 
criterion, and number of errors in each list. 
The Kolmogorov-Smirnov test of deviation 
from a hypothesized distribution was ap- 
plied for a test of normality of the data. 
The Hartley test was used to test the as- 
sumption of homogeneity of variance. No 
distributions were significantly different 
from normal. Evidence for heterogeneity 
of variance was found only in the error 
data of List I. In the subsequent analyses 
of variance, the .05 level of significance was 
used for all comparisons except those in- 
volved in the analysis of the errors of List 
I. The level of significance utilized for the 
errors in List I was .025 (Lindquist, 1953). 

The analysis of CA showed no significant 
difference between the ages of the experi- 
mental and control groups, but the re 
tarded Ss were significantly older than the 
normals. The test of interaction was not 
significant. 
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To determine whether analysis of co- 
variance would be appropriate to control 
for the differences found in CA, Pearson 
correlations were run between CA and the 
trials to criterion and error scores for each 
list. No significant correlations were ob- 
tained, and it was decided to proceed with 
the analyses of variance. 

Table 3 presents the results of List I. 
The analysis of variance revealed no sig- 
nificant differences in the learning of List 
I either for trials to criterion or number of 
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TABLE 3 
Resvutts or Lists I, II, anp III 
Measure 
List Group Trials Errors 

M | SD| M | SD 

I | Normal-Experi- 16.07) 3.01) 21.87) 6.94 
mental | 

Normal-Control 16.40| 3.36) 16.53) 4.68 

Retarded-Experi- | 16.58) 5.82) 19.58) 11.45 

mental 
Retarded-Control | 17.50, 6.02) 22.25) 10.67 


errors. No significant interaction was ob- 
served. The results of List IT show a slightly 
different pattern than did those of List I. 
The means and SDs of List II may also be 
seen in Table 3. While for the trials measure 
there were again no significant differences 
between any groups, the normal Ss did 
make significantly fewer errors in learning 
List II (F = 5.61, df = 1 and 50, P < .05). 
No significant interaction was found. 

Tables 3 and 4 show the results of List 
III. In List III, the experimental Ss learned 
significantly more quickly and with fewer 
errors than did the control Ss. It may also 
be seen that on both measures the normals 
were more efficient than were the retarded 
Ss. No significant interaction was observed 
in the analysis of List ITI. 


Discussion 


The first prediction—that normal chil- 
dren would learn paired-associates more 
efficiently than retarded Ss of the same 
CA—was only partially confirmed. The 
normals made fewer errors on List II and 
were superior to the retarded Ss in both 
measures on List III. The pattern of clearer 
differences between normals and retarded 
over trials suggests that the groups may 
have been learning to learn paired associ- 
ates at different rates. It is possible that 
the technique used in the present experi- 
ment was too insensitive to detect differ- 
ences in learning ability initially. With ad- 
ditional training, the advantage that the 
normals may have had by virtue of being 
better able to learn how to deal with the 


Il | Normal-Experi- 11.07; 3.37 
mental 


Normal-Control 11.67; 4.67) 10.60) 11.15 


Retarded-Experi- 13.83) 4.16) 18.58) 9.39 
mental 

Retarded-Control 11.92) 3.58) 12.75) 10.21 

III | Normal-Experi- 8.93) 3.81) 7.27) 10.32 
mental 

Normal-Control 13.73) 15.39) 16.33) 12.94 

Retarded-Experi- 11.25) 5.95) 8.08) 9.52 
mental 

Retarded-Control 17.67; 5.45) 30.08) 18.13 

TABLE 4 


ANALYSIS OF VARIANCE OF List III 


Trials Errors 


Source df 
ms | F | Ms | F 


Experimental vs. | 1 | 411.13 |15.49*| 2962.96/17.54° 
Control 

Normal vs. Re-| 1 | 130.20| 4.91*} 707.30) 4.19° 
tarded | 

Interaction 1 8.71 | 0.33 | 557.65) 3.30 

Error 50 | 26.54 168.96) 
*P < .05. 


task may have produced the differences 
between the groups found on the second 
and third lists. These results, however, do 
make it possible to argue that the two 
groups encounter List III with comparable 
List I and List II habit strengths (see 
Footnote 2). 

The results of List III show that both 
the normal and retarded mediation groups 
were significantly superior to the controls 
in speed and accuracy in learning the list. 
These results are consistent with the second 
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prediction and suggest a confirmation of 
the mediation hypothesis in the popula- 
tions from which the Ss in this study were 
drawn. The failure to find a significant in- 
teraction involving the intelligence groups 
and the experimental-control treatments 
indicates, in line with the third prediction, 
that there is no evidence in this experiment 
for a relationship between IQ and degree 
of facilitation associated with mediation. 
Thus, whenever classroom learning can be 
fitted to the A-B, B-C, A-C paradigm, the 
results suggest that facilitation on the A-C 
material due to the A-B, B-C training will 
occur, and in the same degree, in retarded 
as well as normal children as long as the 
A-B, B-C material is learned equally well 
«in both groups. Of course, these generaliza- 

tions apply only to groups like those serv- 

ing as Ss in the present experiment. 

A comment regarding a flaw in the de- 
sign of the experiment would seem to be in 
order. The conditions of the study per- 
mitted the experimental Ss fairly extensive 
experience with the odd numbers (List I) 
before these numbers were presented in 
List III. The control Ss, on the other hand, 
had gone through the list of odd numbers 
only once prior to being tested on List III. 
It might be argued that the advantage for 
the experimentals on List III could as eas- 
ily have been due to greater familiarity 
with the materials as to the mediation con- 
dition. 

While this interpretation is possible, it 
should be remembered that all Ss had had 
experience with odd numbers prior to the 
experiment and that no S was permitted to 
continue in the study who could not dem- 
onstrate a great degree of familiarity with 
all the elements. It may thus have been 
that all Ss were acquainted with the ele- 
ments to such a degree that additional ex- 
perience with the odd numbers did not sub- 
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stantially add to their familiarity. If the | 
latter interpretation is accepted, one can 
conclude that a mediation process has been 
shown to aid in the learning of a list of 
paired-associates. 


SuMMARY 


A group of 24 mentally retarded children | 
and a group of 30 normal children were 
each divided into experimental and con- | 
trol groups. Each S learned three lists of | 
paired-associates. The lists which the ex- | 
perimental Ss learned were arranged in an 
A-B, B-C, A-C paradigm. The control Ss 
learned Lists X-B, B-C, and A-C. The ex- 
perimental and control conditions differed 
only in regard to the nature of the stimuli 
of the first list. 

The results supported three conclusions: | 
(a) the normal children learned paired- 
associates more efficiently than did the re- 
tarded children, (6) the mediation (experi- 
mental) condition facilitated the learning , 
of List III for both retarded and normal 
Ss, and (c) the degree of facilitation in the 
learning of List III was not significantly re- 
lated to the IQ level of the groups. 
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COMPARABILITY OF PEABODY PICTURE 
VOCABULARY TEST SCORES UNDER GROUP 
AND INDIVIDUAL ADMINISTRATION’ 


RAYMOND C. NORRIS, JOHN V. HOTTEL, anp SADYE BROOKS 
George Peabody College for Teachers 


The present study was conducted in con- 

' nection with the development and stand- 
s of | ardization of the Peabody Picture Vocabu- 
ex- | lary Test (PPVT). This test consists of 


nan | 150 plates containing four pictures each. 
1 Ss | The pictures on any one plate all represent 
}ex- | objects or activities or states of being. In 
ered | administering the test the examiner pro- 
muli vides a stimulus word orally as in “Show 
me ball.” The subject then points to, or in 
ons: | some other way indicates, the picture on 
red- | the plate which best illustrates the mean- 
> re- | ing of the stimulus word. The plates are ar- 
ranged in ascending order of empirically 
ning | determined difficulty, and the subject is 
‘mal | tested from a basal of eight consecutive 
the | correct responses to a ceiling of six errors 
yre- | on eight consecutive plates. The raw score 
for a subject is his basal score plus the 
number of items answered correctly up to 
his ceiling. A more complete description 
em F of the instrument has been reported by 
ry + Dunn (1959) in his manual. 
ex PURPOSES 
ton. | While the PPVT was designed to be used 
's of | *8 am individual examination, the test au- 
- jp | thor and his staff found it advantageous 
Psy- | in trying out possible test items on some 
levels to administer the items to groups 
| rather than individuals. Photographic 
licit | “ides of tentative item plates were pro- 
arm- | jected onto a screen, and classroom groups 
93. of subjects marked their responses on sep- 
arate answer sheets. The procedure worked 
*The research reported herein was per- 
formed pursuant to Contract SAE-6425, 
United States Office of Education, Depart- 
ment of Health, Education, and Welfare. 
The generous cooperation of Lloyd M. 
Dunn, Project Director, and his project staff 
is gratefully acknowledged. 


so well that consideration was given to 
the possibility of collecting the normative 
data for normal children of late elementary 
school age in the same manner. However, 
there was no evidence that group testing 
would yield scores comparable to those ob- 
tained under individual administration. 
The primary purpose of the present study 
was to test the hypothesis that PPVT 
scores obtained under individual and group 
administrations would not differ signifi- 
cantly or appreciably. If this hypothesis 
were upheld, it was believed that the norm 
sample could be increased greatly while 
effecting considerable savings in time and 
expense. 

Two other purposes helped determine 
the design used in the current study. In the 
preparation of the PPVT, item plates were 
selected in such a way that two items of 
nearly comparable difficulty could be ob- 
tained from each page. These items were 
then assigned to Forms A and B in such 
a way as to equate difficulty of the forms 
for each year level. The authors were in- 
terested in finding out how successful this 
equating process had been and how com- 
parable scores on the two forms might be 
considered. 

The third purpose for the present study 
was the determination of how much prac- 
tice effect could be expected on a second 
administration of the test in alternate 
form. In assessing the outcomes of special 
training programs it seemed desirable that 
practice effect either be eliminated en- 
tirely or estimated very closely. In clinical 
evaluation the second form might be used 
for verification of diagnosis and knowl- 
edge of how much score discrepancy might 
be expected due to practice alone would 
be helpful. 
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Fic. 1. Experimental design of test ad ministrations. 


The three major hypotheses to be tested 
then were: 
H, : PPVT scores obtained under group 
and individual administration do not 
differ significantly (ue = p:)- 
PPVT scores based on Form A do 
not differ significantly from those 
based on Form B (us =p). 
PPVT scores will not be significantly 
altered by the subjects’ having had 
another form of the test earlier 


= pa). 


H; : 


METHOD 


Experimental Design 


The experimental design used in this 
study is called a Type V Mixed Design by 
Lindquist (1953). With two forms of the 
test, two types of administration, and two 
possible orders of administration this de- 
sign made it possible for the effects of all 
three treatments to be evaluated as within- 
subject comparisons. Any given subject 
received both Form A and Form B, both 
an individual and a group administra- 
tion, and both a first and second adminis- 
tration. The increased precision obtained 
by using within-subjects rather than a com- 
bination between-subjects and within-sub- 
jects variability as an error term made it 
more likely that any differential effects of 
forms, treatments, or orders would be 
identified. 


2nd Administration | 

Al class4 | Class 2 | 
N=I15 

| 

8| Class3 | Classi 

N=15 
Individual Group 
Administration 


The experimental design might be dia- 
gramed as in Fig. 1. 


Subjects | 


The four groups of Ss called for in the } 
experimental design were selected from 
the fifth grade of one of the Nashville City 
Schools? Although all members of the four 
classes were tested, the data were analyzed 
only for children of appropriate age for 
grade placement (117 to 137 months in 
the second month of the school year) and 
of approximately normal IQ (Kuhlmanp- 
Finch IQ scores between 80 and 121). Since 
it was not possible to constitute random } 
groups for the purposes of this experiment, | 
the selected Ss within each class became an 
experimental group. While this procedure 
was not considered ideal, the authors were 
not able to identify relevant variables such 
as age, sex, or IQ on which the group: ( 
differed significantly. Assignment of stv- 
dents to the classroom groups had been on 
an essentially random basis at the begin- 
ning of the year. 

Preliminary screening of Ss on the basis | 
of age-grade placement and intelligence left | 
unequal numbers of Ss within the four sub- } 
groups. To facilitate the analysis Ss were 


* The cooperation of W. H. Oliver, Super- 
intendent of Schools, Rupert Klaus, School 
Psychologist, along with the principal and 
teachers involved in the study, is gratefully 
acknowledged. 
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PEABODY PICTURE VOCABULARY TEST 


eliminated at random until each group con- 
tained the same number as the smallest 
subgroup. In this way the number of cases 
per group was reduced to 15. Table 1 pro- 
vides a summary of sex, IQ, and CA data 
on the Ss in each experimental group. 


Procedure 


On the first day of the experiment six 
examiners administered the appropriate 
form of the test to individual Ss in the first 
two groups. The same six examiners also 
administered the appropriate form of the 
test to the remaining two intact groups. 
On the third day of the experiment the 
same six examiners administered the al- 
ternate form of the test to the remaining 
two intact groups. On the third day of the 
experiment the same six examiners admin- 
istered the alternate form of the test to 
those individuals who had been tested 
in groups earlier. The Ss who were tested 
individually on the first day were then 
given the alternate form in a group pres- 
entation. Individual tests were adminis- 
tered to these children in the main room of 
the school library, a small workroom which 
adjoins the library, the school clinic, or 
the private study of the school principal. 
Each child was brought into a testing room 


; and given the following instructions: 


I am going to show you some pictures. 
There are four pictures on each page—one, 
two, three, four (pointing at the same time 
to an appropriate picture on Plate No. Ex- 


ample A). I will say a word, and I want you 


to tell me the number of the picture that best 


} tells the meaning of the word. 


The child was then presented with the three 
sample plates, Examples A, B, and C, and 
the proper form of response was estab- 
lished. The examiner then said: 


That’s fine. Now I’m going to show you 
Some more pictures. I'll say a word, and I 
want you to give me the number of the pic- 
ture that best tells the meaning of the word; 
one, two, three, or four. As we get further 
along, you'll find that you may not be sure 
of the meaning of the word. I want you to 
look carefully at all four pictures anyway, 
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TABLE 1 
BackGRouUND Data ON EXPERIMENTAL 
Groups 
CA 
(Months) | 
Group 

Mean s Mean s M|F 
1 97 6 9 
2 97 .07/10.86|128.53/6.05| 9| 6 
3 101.73) 9.91/125.73)4.98) 7) 8 
4 97 .73)10.35126 .13)5.01 7| 8 
Combined 98 .50)10.19}126 .33 5.18)29}31 


and make the best choice you can. Be sure 
to look carefully at all the pictures. 


As the test progressed, each child was given 
verbal encouragement and admonished 
several times to “look at all the pictures.” 
Testing was begun at the recommended 
beginning point (Plate No. 70) for each 
child. The plates were presented in ascend- 
ing order. If a child committed an error 
before he had achieved eight successful 
identifications, the examiner turned back 
to Plate No. 69, and the plates were pre- 
sented in descending order until a total of 
eight consecutive pictures were identified. 
Eight consecutive successful identifications 
constituted a basal point, and the child was 
assumed to be able to identify correctly 
any picture below that point. Once the 
basal point was established, testing was 
resumed at the point where the error was 
committed and proceeded until the child 
committed six errors out of any eight con- 
secutive presentations (chance perform- 
ance). At this point it was assumed that he 
would continue to perform at only chance 
level. 

The number of the plate where testing 
was terminated was recorded. The total 
number of errors committed was sub- 
tracted from this, and the remainder was 
the child’s raw score. 

Group Test Situation. The plates were 
shown to each of the four classes as a 
group. They were transferred to 35 mm. 
transparencies and projected onto a 60” 
X 60” screen. One examiner operated the 
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projector; one called out the appropriate 
words (each word was given twice); and 
the remaining four examiners served as 
proctors to insure that the children were 
starting at the proper place and that each 
understood the directions. Group testing 
was conducted in the following manner. 
The projector and screen were prepared 
and an examiner drew a facsimile of the 
answer sheet on the chalk board. He then 
said: 

I am going to show you some pictures. 
They will appear here on this screen. There 
will be four pictures on each slide; one, two, 
three, and four (pointing to the four appro- 


priate positions). You see that there are three 
blocks like this one (pointing to illustration 


TABLE 2 


PPVT PerRFoRMANCE OF Sixty Firra- 
GrabE CHILDREN UNDER 
EXPERIMENTAL 
ConDITIONS 


Administra- 
Form on Order 


Mean 76.57 76.75 77.08 76.23 75.97 77.35 
Standard De- 9.90 10.32 10.39 9.81 9.64 10.52 
viation 


TABLE 3 
ANALYSIS OF VARIANCE 


Within 
Subjects 1,490.50 
Form (F) 1 1.01} 1.01) 
Administra- 1 21.67) 21.67) 
tion (A) 
Order (O) 1 57.41) 
FXAxXO 1 5.21) 5.21) .21/4.00/2.80/1.68 
Error (w) 56) 1,405.20| 25.09 
Between 
Subjects 59/ 10,576.49 
FXA 1} 
Fxo 1 143. 00/143.00| .81/4.00/2.80)1.68 
OXxA 1 147.41/147.41| 
Error (b) 56| 9,856. 67|176.01 
Total 119} 12,066. 99: 
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on chalkboard) on the half sheet of paper 
attached to your answer sheets. Mrs. Brooks 


will say a word; I want you to look at the | 


picture on the screen and place an X in the 
block which shows the picture that best tells 
the meaning of the words she says. Let’s try 
some. 


Examples A, B, and C were then presented, 
and the proctors helped each child who 


indicated lack of understanding of, or con- | 


fusion in, following the directions. When 
it was ascertained that all children in the 
class understood the procedure, the ex- 
aminer said: 


Good, now I’m going to show you some 
more pictures, and Mrs. Brooks will say the 
words. I want you to look carefully at all 
four pictures and place an X over the posi- 
tion of the picture that best tells the mean- 
ing of the word Mrs. Brooks says. As we get 
further along, you will find that you aren't 
sure you know the word. But I want you to 
look carefully at all the pictures anyway and 
make the best choice you can. Remember to 
look at all the pictures. 


Testing was then begun with Plate No. 45 
and proceeded through Plate No. 120. 
Plates No. 45 through 69 were presented 
for a period of 10 sec.; Plates No. 70 
through 120 for 15 sec. From Plates No. 
80 through 120 the examiner admonished 
with each fifth plate: “Be sure to look at 
all the pictures” or “Be sure to look at all 
four pictures and make the best choice you 
can.” 

Scoring of group answer sheets was be- 
gun at Plate No. 70 and a basal established 
in the same manner as for individual test- 
ing; that is, scoring progressed upward 
from No. 70 to point of first error. Ii 
eight successful identifications were made 
before an error was committed, scoring con- 
tinued until six errors were committed out 
of eight consecutive presentations. If a 
error was committed before eight succest 
ful identifications had been made, scoring 
proceeded backward from point of first 
error to point where eight consecutive sut- 
cessful identifications were obtained, and 
this point was considered basal. No error 
were counted below this point. Six error 
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out of eight consecutive items established 
the ceiling and no credit was given for 
successful identification beyond this point. 
The raw score for each child was computed 
in the same manner as the raw scores for 
individual presentations. 

No child completed the group tests with- 
out a ceiling, or terminal point, being 
established before Plate No. 120 was 
reached. It was necessary to establish basal 
points for six children by individual pres- 
entation. This was done on the following 
school day. No more than four individual 
plates had to be presented to establish 
these basal points. 

Testing Times. About 10 min. were re- 
quired for individual presentation, and ap- 
proximately 30 min. for group testing. 


ANALYSIS AND DISCUSSION 


Means and standard deviations for all 
60 Ss on each form, under each type of ad- 
ministration, and for each order were de- 
termined. These data are summarized in 
Table 2. 

From Table 3 it can be seen that form of 
the test, type of administration, testing 
order, nor any combination of these factors 
produced a change in PPVT scores which 
could not be attributed to chance at the 
05 level of significance. The obtained F 
ratio for the Form A-Form B comparison 
was so small as to leave little doubt as to 
the equivalence of the two forms for the 
population represented by the group 
tested. Similarly, an F ratio greater than 
that obtained in the individual-group ad- 


ministration comparison would be ob- 
tained more than 50% of the time by 
random sampling alone; considerable con- 
fidence in the equivalance of scores ob- 
tained by individual and group methods 
seems warranted. 

While an F ratio greater than that ob- 
tained between first and second administra- 
tions of the PPVT could be expected be- 
tween 10 and 20% of the time by chance 
alone, the obtained difference of 1.38 raw 
score points was in the direction of a prac- 
tice effect. Consequently, it might be con- 
cluded that the practice effect, if any, 
amounted to only about one point. 


CONCLUSIONS 


When both forms of the PPVT were 
administered in counterbalanced order to 
fifth-grade children of normal intelligence 
and in the proper school grade for their 
age, their average scores were not a func- 
tion of the form administered nor of hav- 
ing the test administered individually or 
in a group. Furthermore, any practice ef- 
fect which might have stemmed from hav- 
ing had the alternate form two days earlier 
averaged only about one point. 
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SOURCES OF TENSION IN MARRIED AND SINGLE 
WOMEN TEACHERS OF DIFFERENT AGES 
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It was the aim of this study to deter- 
mine among female teachers the age dif- 
ferences in degree of conflict evident in 
various areas of psychological adjustment. 
The various areas of psychological adjust- 
ment examined were those which previous 
studies (Peck, 1936; Symonds, 1943; 
Barker, 1948) have indicated are of some 
concern to teachers and school adminis- 
trators. These areas are: (a) Emotional 
Tendencies, (b) Heterosexual Relations, 
(c) Religion, (d) Vocational Outlook, 
(e) Social Acceptability, (f) Teacher- 
Supervisor Relationships, and (g) Phy- 
sical Health. 

The paucity of experimental data on ad- 
justments of teachers precludes the mak- 
ing of adequate generalizations on the ad- 
justment areas examined in this study. 
Actually there have been few studies of 
adults in general in the areas of adjust- 
ment investigated in this study (Hurlock, 
1953; Harsh & Schrinckel, 1950; Young, 
1947), though Havighurst (1953) has spec- 
ulated regarding the type of develop- 
mental tasks which characterize various 
age groups and which might be expected 
to represent areas of adjustment. 

Since married women are probably less 
dependent than single women on their 
jobs, it was hypothesized that they would 
be less likely to accept supervision, 
especially with increasing age. Many of 
the satisfactions of the married teachers 
in such areas of adjustment as Emotional 
Tendencies, Heterosexual Relations, Re- 
ligion, Social Acceptability, and Physical 
Health are derived from their homes and 
families, and perhaps augmented by their 
jobs. Thus, it was felt that little disturb- 
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ac 
ance should be evidenced in these areas.) th 
The area of Vocational Outlook was also| gr 
considered to be nondisturbing, since it) of 
was presumed that married women were! wi 
more likely to be employed in a field of | sp 
their own choice. 
Unmarried teachers, on the other hand,/ 


must rely completely on their jobs, and if! ab 
their choice of teaching as a career was 4 : Su 
poor one, they are “stuck” with it. Thus,’ He 
it was expected that they would show! sel 
disturbance in the areas of Vocational; Po 
Outlook and Teacher-Supervisor Relation-; em 
ships. As marriage becomes less and les} Te: 
likely with increasing age, it was hy-| Ph 
pothesized that the areas of Heterosexual | of 
Relations, Emotional Tendencies, Social | tio. 
Acceptability, and Religion would show in- wit 
creasing disturbance, as the single woman; der 
tries to make other adjustments as a sub wo 
stitute for marriage. Physical Health | Wo 
should also be of great concern to the apy 
single woman, since her income depends} be | 
on her being present to perform her job.) den 
Generally, then, it was proposed that) sent 
single women would be more disturbed! wer 
in most areas at all age ranges than mat f pro 
ried women. mer 
as t 
PROCEDURE AND SUBJECTS if 
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The Stimulus Material was 
Most studies of adjustment problem: | the 
have employed check lists or question) and 
naires. In contrast, the present study) to b 
utilized responses and reaction times it} initi 
word association tests as indirect measure: 1} 
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shown that reaction time in response to 
stimulus words, as well as the type of re- 
sponse given to a conflict word, differ 
from the reaction time and type of verbal 
response given to a neutral word. Thus, 
it seems reasonable to assume (a) that 
word symbols stand as surrogates for ac- 
tual experience in a given area of adjust- 
ment, (6) that there will be a longer re- 
action time to words in the area in which 
the teacher is experiencing the greater de- 
gree of conflict, and (c) that the types 
of verbal response given a “conflict” word 
will differ from the types of verbal re- 
sponse given to a “neutral” word. 

Stimulus words having emotional as- 
sociations with each of the areas mentioned 
above, with the exception of Teacher- 
Supervisor Relationships and Physical 
Health, as well as “neutral” words, were 
selected from the word lists employed by 
Powell (1955). Additional words having 
emotional connotations in the areas of 
Teacher-Supervisor Relationships and 
Physical Health were selected from a list 
of stimulus words having possible emo- 
tional associations in these areas prepared 
with the assistance of undergraduate stu- 
dents in child psychology. This list of 
words was then checked against Rinsland’s 
Word List (1945), and only those words 
appearing for fifth graders were selected to 
be judged by 10 Education graduate stu- 
dents.’ The words finally selected to repre- 
sent the two critical areas mentioned above 
were those among 75 surviving the above 
procedure on which there was 100% agree- 
ment among the 10 Educational Students 
as to area of adjustment represented. 

In its final arrangement for experi- 
mental use, a practice series of five words 
was given to each S at the beginning of 
the test to clear up any misunderstandings 
and to familiarize the S with the technique 
to be followed. This was followed by the 
initial neutral word series of seven words 


* Rinsland’s list gives the frequency of use 
of words at each grade level as determined 
from various papers (compositions, theses, 
ete.) done by children at each age level. 
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to determine the average neutral reaction 
time for each S. Finally, each critical list 
of five words for the adjustment areas was 
followed by a neutral list of four words. 
These neutral words were inserted be- 
tween each critical word series to act as 
a partial control for any “generalization” 
effect which has been noted in previous 
studies (the emotional tone aroused by the 
critical stimulus words extending to the 
subsequent reaction causing the appear- 
ance of a “disturbance” where no disturb- 
ance actually exists). However, this gen- 
eralization effect was not noted in Powell’s 
study (1952) nor in this study. 

The critical word series (indicated by 
astericks) and the intervening neutral 
word series, together with the initial neu- 
tral word series and practice series were: 


Practice Series: White, Cat, Man, Black, 
Girl 
Initial Neutral Series: Canary, Counter, 
Ladder, Lantern, Lighthouse, Shadow, 
Tower 


*Emotional Tendencies: Worry, Afraid, Un- 
happy, Restless, Anxious 
Neutral Series: Apple, Window, Minnow, 
Piano 


*Heteroserual Relations: Dance, Kissing, 
Marriage, Dates, Hugging 
Neutral Series: Second, Tree, Bench, Sand 


*Religion: Church, God, Heaven, Worship, 
Prayer 
Neutral Series: Leaves, Fender, Magazine, 
Sandwich 


*Vocational Outlook: Wages, Money, Em- 
ployed, Hire, Job 
Neutral Series: Daylight, Photograph, 
Bean, Monday 


*Social Acceptability: Popular, Friend, 
Lonely, Party, Unfriendly 
Neutral Series: Cardboard, Banana, Har- 
monica, Door 


*Teacher-Supervisor Relationships: Supe- 
rior, Authority, Supervisor, Principal, 
Boss 

Neutral Series: Sleep, Carpet, Hill, Floor 


*Physical Health: Sick, Hospital, Invalid, 
Pain, Doctor 
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TABLE 1 


Mean AGEs IN YEARS OF EXPERIMENTAL 
POPULATION 


(N = 25 in each group in each age range) 


f 
Age Range Teachers 
20-29 24.6 24.2 
30-39 35.1 34.4 
40-49 45.5 44.7 
50-59 54.8 55.6 


Neutral Series: Lamp, Elevator, Table, 
Balloon 


At the conclusion of the test the S was 
asked to give information as to date of 
birth and marital status. The entire test 
was administered in privacy in a room, 
well lighted, containing a table or desk 
and chairs along with the experimental 
apparatus. The reaction time apparatus 
used in this study and the procedure fol- 
lowed in administering the directions were 
the same as those used by Powell (1952). 


Subjects 


A total of 200 white female public 
school elementary teachers (100 married 
and 100 unmarried)’ ranging in age from 
20 to 59 years participated in this study. 
All Ss in the investigation were teaching 
in elementary public schools in the Greater 
Cleveland area at the time the study was 
made, and were selected so as to comprise 
samples of 25 married and unmarried in- 
dividuals in each of four age categories at 
age ranges as follows: (a) 20-29, (b) 30- 
39, (c) 40-49, and (d) 50-59. The mean 
age for each of these groups in each age 
range is presented in Table 1. 

All Ss in this study were assured that 
they would remain anonymous and that 
results obtained for each individual would 


*For the purpose of this study, the un- 
married woman is one who has not been 
married at any time before this study. The 
married woman is one who was living with 
her spouse at the time of this study. Thus, 
this investigation excludes widows, divor- 
cees, and those otherwise separated from 
their spouses. 


be kept confidential. Ss in this study held 
at least a bachelor’s degree from an ac- 
credited college or university. 


Methodological Considerations for Analyz- 
ing Obtained Data 


The accuracy of the electric chronom- 


manipulating the switch which stopped 
the electric chronometer were checked on 
various occasions during the course of this 
investigation as described by Powell 
(1952). In addition, two independent ob- 
servers were employed to check the re- 
liability of the reaction time recorded. 
Each observer (O) used a separate switch 
to control his own chronometer. The re- 
action times recorded by the O and the £ 
were compared for the purpose of obtain- 
ing the differences in reaction time. The 
results indicated a correlation of .90 be- 
tween O’s and E’s reaction time. 

A prime factor to be considered in re- 
action time (the interval between the in- 
stant the stimulus word is presented and 
the instant the response word is given by 
the S) is that each individual has a re- : 
action time that is peculiar to him. A | 
second factor to be considered is the pos- | 
sibility of changes in reaction time, per ! 
se, with age, which has been demonstrated 
in a number of studies. For these reasons 
it seemed best to use each S as his own 
control in order to correct for any possible 
changes in gross reaction time with age, 
and to derive a mean difference score for 
each individual in each area of adjust- 
ment. 

The mean difference score was obtained 
for each individual by determining each 
individual’s average reaction time to the 
initial “neutral word” list and then sub- 
tracting therefrom the individual’s aver- 
age reaction time to “critical” words i 
each of the areas of adjustment. (For ex- 
ample, if an individual’s average reaction 
time to the initial “neutral” series was 
1.96 seconds and his reaction time to 4 
“critical” word was 2.76 seconds, the dif- 
ference score was 0.80 seconds). The aver- 
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age of the five difference scores in each 
critical area was the mean reaction-time 
score used in the statistical analyses in 
this investigation. 

Difference scores were computed for the 
married group and the unmarried group 
at each of the various age ranges. It was 
hypothesized that as the degree of con- 
flict in the various areas of psychological 
adjustment increased there would be a 
slower reaction time to the “critical” 
stimulus words in these various areas. The 
Fisher-Behrens d test (1939) was em- 
ployed to determine whether the increase 
in reaction time is greater than that which 
could be accounted for by sampling errors 
from a population of such difference scores. 

The d test was used in this investigation 
rather than analysis of variance, since the 
age ranges at which significant differences 
in adjustment should have appeared were 
predicted before the collection of data for 
this study, and the d test was considered 
to be a precise test of the differences 
postulated. With this test it is not neces- 
sary to assume that the two samples come 
from populations having the same vari- 
ance. The formula used to determine d 
in this study is the raw score formula pre- 
sented by Johnson (1949), and by using 
the level of confidence tables derived by 
Sukhatme (1938). 


Qualitative Word Analysis 


For a qualitative analysis the words 
given by the Ss with reference to the 
stimulus words were classified into six 
categories: 


Equivalent: Definition of the stimulus word, 
including synonyms 

Logical: Any logical relation to the stimu- 
lus word, including antonyms 

Personal: Any association of a personal 
tvpe, including names, likes, or dis- 
likes 

Perseverative: Any repetition in series. 
Same word repeated regardless of stimu- 
lus word 

Irrelevant : 
word 

No response 


No relation to the stimulus 
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There was 91% agreement among three 
judges and the £ as to proper categoriza- 
tion. 

In the final analysis, the verbal re- 
sponses which were made by the Ss were 
placed into two categories—those con- 
sidered “nondisturbed,” which included 
equivalent plus logical and those con- 
sidered “disturbed,” which included per- 
sonal, perseverative, irrelevant, and no re- 
sponses. Any finer breakdown seemed 
inadvisable as reported by Powell (1952) 
in his attempt to categorize responses in 
his study. 


REsvuLts AND Discussion 


An analysis of the mean reaction time 
scores to the initial neutral scores was 
made before attempting to test for age 
and marital differences in the various areas 
of psychological adjustment. Means and 
standard deviations for the several age and 
marital groups are contained in Table 2. 
A comparison of the d test of the relia- 
bility of the difference between married 
and unmarried groups at the four suc- 
cessive age ranges yielded d’s of 1.028, 
1.612, 1.463, and 0.324, respectively. None 
of these was of sufficient magnitude to 
warrant rejection of the null hypothesis. 
Comparison of married individuals from 
one age group to the next (20-29 vs. 30- 
39; 30-39 vs. 40-49; 40-49 vs. 50-59) 
yielded d’s of 1.438, 2.918, and 0.865, re- 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF 
Reaction Time Scores 1n SECONDS OF 
MARRIED AND UNMARRIED TEACHERS 

aT Various AGE RANGES TO INITIAL 
NevuTRAL Worps 


(N = 25 in each group in each age range) 


Age 


20-29 
30-39 
40-49 
50-59 
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spectively. Only the second (30-39 vs. 
40-49) was significantly large to reject the 
null hypothesis, and this at the .01 level 
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It is of interest that age and marital 
differences in reaction time were negligible, | 
even though the “difference scores” uti- | 
lized in other comparisons removed the in- 


of confidence. Similar d’s in the case of the 
unmarried were 1.205, 0.058, and 0.306, fluence of such differences in the com- 
none of which was large enough to warrant _ parisons to be turned to next. 
rejection of the null hypothesis. Mean reaction time difference scores to 
2.1 2.10 
— Married 
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2 1.9 2 1.90 
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/ 
1.3 =3 1.30 
\ / 
/ 4 
& 0.7 2 0.70 x 7 
= 0.50 2 0.50 \ 7 
0.30 
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Age range 
Vocational Social Teacher-Supervisor Physical Health 
Outlook Acceptability Relationships 
Fic. 1. Mean reaction time scores in seconds of married and unmarried teachers at various 
age ranges to initial neutral words, and mean reaction time difference scores in seconds of mat 
ried and unmarried teachers at various age ranges to “critical” words taken from various psy- 
chological adjustment areas. 
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the “critical” words pertaining to the vari- 
ous psychological adjustment areas for 
the various age and marital groups are 
presented in graphic form in Fig. 1 in 
order to show the contrast visually. The 
means, standard deviations, and the results 
of the d tests are presented in Tables 3, 
4, and 5. 

Figure 1 shows a striking contrast be- 
tween the married and unmarried women 
in their responses to words relating to 
Emotional Tendencies, with the unmarried 
responding so as to suggest greater emo- 
tional disturbance. As the d’s in Table 4 
indicate, the differences were significant 
at all ages except 40 to 49. Although the 
general trend for both groups with in- 
creased age was in the direction of greater 


emotional disturbance, when the adjacent 
age groups were compared only the dif- 
ference between the 30-39 and 40-49 
groups in the case of the married women 
was significant. The greater general emo- 
tional disturbance on the part of the single 
woman may be explained either in terms 
of selection (being less likely to marry) or 
in terms of general frustration arising 
from their continued single state. Other 
studies suggest that marriage and children 
constitute one of the major sources of life 
satisfaction among women, and it is pos- 
sible that frustration in this area might 
increase general emotionality especially in 
the 30’s when a reorientation of goals 
(from marriage to career) is taking place. 

That heterosexuality does contribute a 


TABLE 3 


| Means AND STANDARD DEVIATIONS OF THE DIFFERENCE SCORES OF THE VARIOUS AREAS 


or PsYcHOLOGICAL ADJUSTMENT 


20-29 30-39 40-49 50-59 
Mean | SD | Mean | SD | Mean | SD | Mean | SD 

Emotional Tendencies | 

Married .625 | .395 509 | .245| .882| .629/ .838 583 

Unmarried 1.196 619 | 1.711 | 1.343 | 1.512 | 1.463 | 1.602 | 1.176 
Heterosexual Relations 

Married .695 .330 .487 .297 .574 .331 .626 432 

Unmarried 2.022 | 1.081 | 2.075 | 2.065 | 1.226 .901 .877 433 
Religion 

Married | .434 .360 .318 | .655 503 | .917 800 

Unmarried .614 .367 .806 348 | 1.080 | 1.098 | 1.150 940 
Vocational Outlook 

Married .563 | .498 | .515 461 .609 388 | .696 486 

Unmarried .857 .625 | 1.163 594 | 1.076 998 858 590 
Social Acceptability 

Married .543 | .380 | .575 .272| .628| .343 .966 | 1.002 

Unmarried 1.486 | 1.176 | 1.780 | 1.185 | 1.578 .788 | 1.318 948 
Teacher-Supervisor Rela- 

tions 

Married 1.264 734 .978 .476 | 1.889 | 1.873 | 1.795 | 1.223 

Unmarried 1.177 698 | 1.264 651 | 1.289 890 | 1.540 -781 
Physical Health 

Married .364 .375 | .349 .514] .381 .676 

Unmarried .689 | .594/| .899 .871 .848 | .627 | 1.116 
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TABLE 4 
d Test or SratisticaL SIGNIFICANCE OF DIFFERENCES IN THE DIFFERENCE SCORES BE. gest 
TWEEN MARRIED AND UNMARRIED FEMALES aT Various AGE LEVEL To WorpDs sexu 
DRAWN FROM THE VARIOUS PsycHOLOGIcAL ADJUSTMENT AREAS ) asa 
(VaLuEs oF FIsHER-BEHRENS d) ane 
Age Ranges in Years won 
Adjustment Area posit 
20-29 30-39 40-49 50-59 tensi 
Emotional Tendencies 3.890** 4.405** 1.979 2.912° | 
Heterosexual Relations 5.869** 3.805** 3.396** 2.051 | aso 
Religion 563 4.728** 1.760 943 both 
Vocational Outlook 1.839 4.309** 2.180* 1.060 | betw 
Social Acceptability 3.815** 4.955** 5.526** 1.276 
Teacher-Supervisor Relation- -429 1.774 1.447 
ships 
Physical Health 2.315* 2.971** 2.575* 2.304° son. 
40+! 
* .05 level of confidence. 
** 01 level of confidence. 
TABLE 5 
d Test or Sratisticau SIGNIFICANCE OF AGE RANGE DIFFERENCES IN THE DIFFERENCE) h 
Scores IN THE VaRIOoUS AREAS OF PsYCHOLOGICAL ADJUSTMENT, BY MARITAL i ad 
Stratus (VALUES oF FisHER-BEHRENS d) findir 
turba 
20-29 30-39 4049 in the 
Stimulus Area Marital Status vs. vs. vs. 
30-39 40-49 50-59 Possil 
Emotional Tendencies Married 1.249 2.763** 257. = ts 
Unmarried 1.742 .240 ieion 
on 
Heterosexual Relations Married 2.343* .978 478 may | 
Unmarried 114 1.884 1.745 stituti 
Religion Married 1.764 2.477* 1.396 | While 
Unmarried 1.899 1.190 2.421° } ide 
on 
Vocational Outlook Married .354 -780 -700 The 
Unmarried 1.774 -940 indica 
Social Acceptability Married 342 605 1.596 cantly 
Unmarried 881 .710 1.054 
the n 
Teacher-Supervisor Relationships Married 1.635 2.357** -210 group 
Unmarried .456 1.060 ferenc 
Physical Health Married .153 1.669 1.244 usual 
Unmarried -996 1.368 marrie 
this 
* .05 level of confidence. en 
** 01 level of confidence. lade d 
y 
major source of tension for single women, nificant. In comparisons of adjacen* nage ! 
especially in the younger adult years, is groups, however, only the 30-39 vs. 4 ther, 
clearly evident in the chart. At all ages 49 difference in the case of single womet becom 
except the oldest, the difference between iis reliable, though the trend is obviow ope 
the married and unmarried are highly sig- These findings may be interpreted as sug rs on 
non 
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gesting that the opportunity for hetero- 
sexual relations in marriage reduces this 
as a tension area for married women, while 
an eventual turning to a career may serve 
= to remove heterosexuality from a central 
position and thus reduce it as a source of 
tension. 

2° Religion, contrary to our hypothesis, 
1 | also contributes a source of tension for 
13? both groups with the greatest difference 
0 | between the marital status groups oc- 
6 | curring at age 30-39 with the unmarried 
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women showing significantly greater ten- 
4* sion. At the adjacent ages of 30-39 vs. 
40-49 married women showed significantly 
greater disturbance at the 40-49 level, 
while for single women the greatest dis- 
| turbance appeared at the 40-49 vs. 50- 
59 year levels with the 50-59 level being 
/: _ the highest. It is difficult to discuss these 
} findings, since the reasons for religious dis- 
—==/ turbance are not so readily apparent as 
- in the case of the other areas in this study. 
so Possibly the greater disturbance of 40—49- 
7 year-old married women is associated with 
40 raising children and orienting them to re- 
: ligion. For the single 30-39-year-old, it 
78 may represent an attempt to find a sub- 
45 | stitute for marriage or heterosexuality, 
95 while the 50-59 yr. disturbance may indi- 
9i* } cate a last major attempt to find in re- 
ligion a source of satisfaction. 
> The area of Vocational Outlook results 
96 
54 
10 
60 
44 
68 


indicate that single women are signifi- 
cantly more disturbed about vocations 
during the 30-39 and 40-49 yr. levels than 
the married women are. Adjacent age 
group data demonstrate no significant dif- 
ferences in disturbance. Since the job is 
usually secondary to family life for the 
married women, her lack of concern in 
this area is easily understood. To the single 
woman the job is all important, particu- 
larly during the 30’s and 40’s when mar- 
nt gt tiage becomes increasingly remote. Fur- 
s. 4¢-) ther, it is during the 30’s that the job 
a becomes a necessity, even if it is a job 
pviows# Which is not satisfying to the individual, 
1s suf but one in which she must remain for 
economic reasons. 
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Figure 1 shows striking differences be- 
tween married and unmarried women in 
the responses to words relating to Social 
Acceptability with the differences being 
very significant at all ages except the last. 
The adjacent age data show no significant 
differences between age levels for either 
group. Since marriage is in itself a form 
of social acceptance, this finding is not 
surprising. Apparently, the single female 
continues to strive for social acceptance 
well into her 40’s, but by 50-59 either has 
found friends or has given up her search. 

The area of Teacher-Supervisor Rela- 
tionships offered results somewhat con- 
trary to those hypothesized. There was no 
significant differences between the married 
and unmarried women at the age ranges 
studied. The only significant difference in 
the adjacent ranges was between 30-39 
vs. 40-49 for married women with the dis- 
turbance greatest at the 40-49 yr. level. 
It had been expected that single women 
would have more conflict since they needed 
the job more. Apparently, however, neither 
group has any serious difficulty in this area 
except for the age mentioned above. At 
40-49, the married women may feel more 
independent about their jobs, and may be 
more willing to resist authority, or may in 
fact be striving to reach a position of more 
authority. 

The data indicate that single women are 
significantly more concerned about Physi- 
cal Health at all ages studied, but there 
were no significant adjacent age differences 
for either group. This greater concern 
demonstrated by single women is probably 
due to their need to care for themselves. 
Where the married woman has others 
around to care for her, the single woman 
is alone. Poor physical health for the single 
woman may also relate to fear that illness 
may bring about loss of pay, which she can 
poorly afford. 


RESULTS OF THE QUALITATIVE 
Worp ANALYSES 


The results of the qualitative analysis 
of the verbal responses to “critical” stimu- 


buy 


100 


TABLE 6 
PERCENTAGES OF “‘DiIsTURBED’’ RESPONSES OF MARRIED (M) AND UNMARRIED (U) 
TeacHERS aT VARIOUS RaNGEs TO “CriTICAL’’ Words IN THE VARIOUS 
AREAS OF PsYCHOLOGICAL ADJUSTMENT 


MARVIN POWELL AND CHARLES D. FERRARO 


Age Range 
Adjustment Area 20-29 30-39 4049 50-59 

M U M U M U M U 
Emotional Tendencies 7% 12% 8% 13% 12% 15% 14% 16% 
Heterosexual Relations 2 7 6 15 10 16 3 Q 
Religion 6 ll 3 9 2 7 8 4 
Vocational Outlook ll S$ 6 5 4 10 4 7 
Social Acceptability 10 15 15 18 12 18 12 19 
Teacher-Supervisor Rela- 6 6 6 9 10 6 5 6 

tionships 

Physical Health 4 6 3 6 6 7 3 sill 


lus words related to the various areas of 
psychological adjustment do not substanti- 
ate the hypotheses as do the results of the 
reaction time study technique described 
earlier. However, certain trends may be 
apparent from an inspection of Table 6 
in which the percentages of “critical” words 
related to the various areas of psycho- 
logical adjustment are presented. Since 
no significant differences were found, no 
conclusion can be reached. 


SuMMARY 


Reaction times of 25 single and 25 
married elementary school teachers in each 
of four age groups (20-29, 30-39, 40-49, 
50-59) were obtained to stimulus words 
representing seven areas of psychological 
adjustment. Differences between the mar- 
ried and single were especially large in the 
areas of Emotional Tendencies, Hetero- 
sexual Relations, Social Acceptability, and 
Physical Health, with the single women’s 
responses reflecting greater tension in 
each of these areas. In two other areas 
marital status differences were reliable for 
only one or two age groups: Religion (30- 
39), Vocational Outlook (30-39, 40-49). 
In the remaining area (Teacher-Super- 
visor Relationships) no difference ap- 
peared at any age level. Except in the 
last instance, wherever differences ap- 


peared the single group’s responses re- 
flected more tension. Age trends were not 
clear-cut, but a trend toward greater ter- 
sion with increasing age was apparent for 
both marital groups in Emotional Ten- 
dencies, Religion, Teacher-Supervisor Re- 
lationships, and Physical Health. A sharp 
drop among single women from the 30-3! 
age level onward was apparent in the cas 
of Heterosexual Relations as a tension 
area. 

These findings serve to identify sources 
of tension characterizing teacher personnd 
(and presumably other employed women) 
and thus are of interest to teachers them- 
selves as well as to others concerned wit) 
the mental hygiene of teachers. Of special 
concern to administrators should be the 
finding that overall (ages and maritd 
groups combined) the area of Teacher 
Supervisor Relationships appeared to bk 
& major source of tension among the sev- 
eral adjustment areas studied. 
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